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POLYNUCLEOTIDE AND ITS USE FOR MODULATING A DEFENCE RESPONSE IN PLANTS 

The present invention relates to stimulating a defence 
response in plants, with a view to providing the plants with 
5 enhanced pathogen resistance. More specifically, it has 

resulted from cloning of the barley Mlo gene, various mutant 
mlo alleles, and a number of homologues from various species. 
The Mlo gene has been isolated using a positional cloning 
approach which has never previously been successful in Barley. 

.10 Details and discussion are provided below. Wild- type Mlo 

exerts a negative regulatory function on a pathogen defence 
response, such that mutants exhibit a defence response in the 
absence of pathogen. In accordance with the present invention, 
down -regulation or out -competition of Mlo function may be used 

15 to stimulate a defence response in transgenic plants, 
conferring increased pathogen resistance. 

Mutations have been described in several plants in which 
defence responses to pathogens appear to be constitut ively 
expressed. Mutation -induced recessive alleles {mlo) of the 

2 0 barley Mlo locus exhibit a leaf, lesion phenotype and confer an 
apparently durable, broad spectrum resistance to the powdery 
mildew pathogen, Erysiphe graminis f sp horde i . 

Resistance responses to the powdery mildew pathogen have 
been genetically well characterized (Wiberg, 1974; Sagaard and 

25 J0rgensen, 1988; J0rgensen, 1994) . In most analyzed cases 
resistance is specified by race-specific resistance genes 
following the rules of Flor's gene-f or-gene hypothesis (Flor, 
1971) . In this type of plant /pathogen interaction, resistance 
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is specified by and dependent on the presence of two 
complementary genes, one from the host and one from the fungal 
pathogen. The complementary genes have been termed 
operationally (pathogen) resistance i n R") gene and avirulence 
gene, respectively. Most of the powdery mildew resistance genes 
(Mix) act as dominant or semidominant traits (Jergensen, 1994) . 

Monogenic resistance mediated by recessive (mlo) alleles 
of the Mlo locus is different. Apart from being recessive, it 
differs from race-specific resistance to single pathogen 
strains in that (i) it confers broad spectrum resistance to . 
almost all known isolates of the pathogen (ii) mlo resistance 
alleles have been obtained by mutagen treatment of any tested 
susceptible wild type (Mlo) variety, and (iii) mlo resistance 
alleles exhibit a defence mimic phenotype in the absence of the 
pathogen (Wolter et al. , 1993). Thus, the genetic data 
indicate the Mlo wild type allele exerts a negative regulatory 
function on defence responses to pathogen attack. 

Resistance mediated by mlo alleles is currently widely 
used in barley breeding and an estimated 10 million hectares 
are annually planted in Europe with seeds of this genotype. A 
x mlo like' inherited resistance to powdery mildew in other 
cereal plants has not been reported so far although the fungus 
is a relevant pathogen in wheat (attacked by Erysiphe graminis 
f sp tritici), oat (attacked by E. g. f sp avenae) , and rye 
(attacked by E . g. f sp secalis) . Because cereals are 
morphologically, genetically and biochemically highly related 
to each other (Moore et al . , 1995), one would predict the 
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existence of homologous genes in these species. The failure to 
have found a y mlo like' inherited resistance in wheat and oat 
is probably due to their hexaploid genomes, making it difficult 
to obtain by mutagenesis defective alleles in all six gene 
copies, and the chance of all such mutations occurring in 
Nature is remote. The failure to have found a mlo equivalent 
in other cereals is probably due to insignificant amount of 
mutational analysis in these species and complications as a 
result of their outbreeding nature (e.g. rye). 

RFLP markers closely linked to Mlo on barley chromosome 4 
were previously identified on the basis of a mlo backcross line 
collection containing mlo alleles from six genetic backgrounds 
(Hinze et ai. , 1991) . The map position of Mlo on the basis of 
RFLP markers was consistent with its chromosomal localization 
as determined by a previous mapping with morphological markers 
(Jargensen, 1977) . 

Having identified an -3cM genetic interval containing Mlo 
bordered by genetic markers, we decided to attempt to isolate 
the gene via positional cloning. 

However, there is no documented example of a successful 
positional cloning attempt of a barley gene. We were faced 
with a number of difficulties. 

9 

Firstly, the genome of barley (5.3x10 bp/haploid genome 
equivalent; Bennett and Smith, 1991) has almost double the size 
of the human genome and because the total genetic map covers 
-1.800 cM (Becker et al . , 1995) we were confronted with a very 
unfavourable ratio of genetic and physical distances (1 cM 
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corresponds to - 3 Mb) . 

Secondly, a high resolution genetic map had to be 
constructed around Mlo enabling the positioning of linked 
markers with a precision of better than 0.1 cM. 

Thirdly, we aimed to physically delimit the target gene 
and both flanking DNA markers on individual large insert 
genomic clones, a procedure later termed "chromosome landing" 
(Tanksley et al . , 1995). For this purpose, a complete barley 
YAC library from barley Megabase DNA had to be constructed with 
an average insert size of 500-600 kb, which was unprecedented. 

Fourthly, we had to prepare unusual genetic tools that 
enabled us to identify the Mlo gene within a physically 
delimited region without the need for a time consuming 
generation of barley transgenic plants and testing of different 
candidate genes. We used for our studies ten characterized 
radiation- or chemically-induced mlo mutants (Jorgensen, 1992). 
For a conclusive chain of evidence of the gene isolation we 
decided to depend upon a functional restoration of the wild 
type Mlo allele starting out from characterized mlo defective 
alleles. For this purpose, we performed mlo heteroallelic 
crosses and isolated susceptible intragenic Mlo recombinants. 
The sequence analysis of these proves the function of the 
described gene. 

The cloning of the barley Mlo gene and homologues, 
including homologues from other plant species, gives rise to a 
number of practical applications, reflected in the various 
aspects of the present invention. 
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According to a first aspect of the present invention there 
is provided a nucleic acid molecule comprising a nucleotide 
se q Uence encoding a peptide with Mlo function. Those skilled in 
the art will appreciate that "Mlo function" refers to the 
ability to suppress a defence response, said defence response 
being race and/or pathogen independent and autonomous of the 
presence of a pathogen, such as, for example, the Mlo gene of 
barley, the Acd gene and the Lsd gene of Arabidopsis . 

mlo mutations that down- regulate or disrupt functional 
expression of the wild- type Mlo sequence are recessive, such 
that they are complemented by expression of a wild-type 
sequence. Thus "Mlo function" can be determined by assessing 
the level of constitutive defence response and/or 
susceptibility of the plant to a pathogen such as, for example, 
powdery mildew or rust (e.g. yellow rust) . Accordingly, a 
putative nucleotide sequence with Mlo function can be tested 
upon complementation of a suitable mlo mutant. The term "mlo 

function" is used to refer to sequences which confer a mlo 
mutant phenotype on a plant. 

The capitalisation of "Mlo" and non-capitalisation of 

"mlo" is thus used to differentiate between "wild- type" and 

"mutant " function . 

A mlo mutant phenotype is characterised by the exhibition 

of an increased resistance against one or more pathogens, which 

is race and/or pathogen independent and autonomous of the 

presence of a pathogen. 

The test plant may be monocotyledonous or dicotyledonous . 
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Suitable monocots include any of barley, rice, wheat, maize or 
oat, particularly barley. Suitable dicots include Arabidopsis . 

Nucleic acid according to the invention may encode a 
polypeptide comprising the amino acid sequence shown in Figure 
2, or an allele, variant, derivative or mutant, or homologue, 
thereof . 

Nucleic acid according to the present invention may have 
the sequence of a Mlo gene of barley, or be a mutant, variant 
(or derivative) or allele of the sequence provided, or a 
homologue thereof. Preferred mutants, variants and alleles are 
those which encode a sequence which retains a functional 
characteristic of the wild-type gene, especially the ability to 
suppress a defence response as discussed herein. Other 
preferred mutants, variants and alleles encode a sequence 
which, in a homozygote, cause constitutive activation of a 
defence response, or at least promotes activation of a defence 
response (i.e. is a mlo mutant sequence), e.g. by reducing or 
wholly or partly abolishing Mlo function. Preferred mutations 
giving mlo mutant sequences are shown in Table 1 . Changes to a 
sequence, to produce a mutant, derivative or variant, may be by 
one or more of addition, insertion, deletion or substitution of 
one or more nucleotides in the nucleic acid, leading to the 
addition, insertion, deletion and/or substitution of one or 
more amino acids. Of course, changes to the nucleic acid which 
make no difference to the encoded amino acid sequence are 
included. Particular variants, mutants, alleles and 
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derivatives are discussed further below, as well as homologues . 

A preferred nucleic acid sequence according to an aspect 
of the present invention is shown in Figure 2 along with the 
predicted amino acid sequence. Nucleic acid may be subject to 
5 alteration by way of substitution of nucleotides and/or a 

combination of addition, insertion and/or substitution of one 
or more nucleotides with or without altering the encoded amino 
acids sequence (by virtue of the degeneracy of the genetic 
code)-. 

10 As discussed below, further aspects of the present 

invention provide homologues of the Mlo sequence shown in 
Figure 2, including from rice (genomic sequence Figure 5, 
bottom line, cDNA sequence Figure 10, amino acid sequence 
Figure 13) and barley (genomic sequence Figure 6, bottom line, 

15 cDNA sequence Figure 11, amino acid sequence Figure 14) ; also 
Table 5B (nucleotide sequences) and Figure 5A (amino acid 
sequences) show homologous EST" s from rice and Arabidopsis . 

The present invention also provides a vector which 
comprises nucleic acid with any one of the provided sequences, 

20 preferably a vector from which a product can be expressed. The 
vector is preferably suitable for transformation into a plant 
cell and/or a microbial cell. The invention further encompasses 
a host cell transformed with such a vector, especially a plant 
cell or a microbial cell (e.g. Agrobacterium tuwefaciens) . 

25 Thus, a host cell, such as a plant cell, comprising nucleic 

acid according to the present invention is provided. Within the 
ceil, the nucleic acid may be incorporated within the nuclear 
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genome, i.e. a chromosome. There may be more than one 
heterologous nucleotide sequence per haploid genome. 

A vector comprising nucleic acid according to the present 
invention need not include a promoter, particularly if the 
vector is to be used to introduce the nucleic acid into cells 
for recombination into the genome. 

Nucleic acid molecules and vectors according to the 
present invention may be provided in a form isolated and/or 
purified from their natural environment, in substantially pure 
or homogeneous form, or free or substantially free of nucleic 
acid or genes of the species of interest or origin other than 
the relevant sequence. Nucleic acid according to the present 
invention may comprise cDNA, RNA, genomic DNA and may be wholly 
or partially synthetic. The term "isolate" may encompass all 
these possibilities . 

The present invention also encompasses the expression 
product of any of the nucleic acid sequences disclosed and 
methods of making the expression product by expression from 
encoding nucleic acid therefore under suitable conditions in 
suitable host cells, e.g. E. coli. Those skilled in the art 
are well able to construct vectors and design protocols for 
expression and recovery of products of recombinant gene 
expression. Suitable vectors can be chosen or constructed, 
containing one or more appropriate regulatory sequences, 
including promoter sequences, terminator fragments, 
polyadenylation sequences, enhancer sequences, marker genes and 
other sequences as appropriate. For further details see, for 
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example, Molecular Cloning: a Laboratory Manual: 2nd edition, 
Sambrook et al, 1989, Cold Spring Harbor Laboratory Press. 
Transformation procedures depend on the host used, but are well 
known. Many known techniques and protocols for manipulation of 
nucleic acid, for example in preparation of nucleic acid 
constructs, mutagenesis, sequencing, introduction of DNA into 
cells and gene expression, and analysis of proteins, are 
described in detail in Short Protocols in Molecular Biology, 
Second Edition, Ausubel et al . eds . , John Wiley & Sons, 1992. 
The disclosures of Sambrook et al . and Ausubel et al . are 
incorporated herein by reference, along with all other 
documents mentioned. 

Purified Mlo protein, or a fragment, mutant or variant 
thereof, e.g. produced recombinantly by expression from 
encoding nucleic acid therefor, may be used to raise antibodies 
employing techniques which are standard in the art. Antibodies 
and polypeptides comprising ant igen- binding fragments of 
antibodies may be used in identifying homologues from other 
species as discussed further below. 

Methods of producing antibodies include immunising a 
mammal (eg human, mouse, rat, rabbit, horse, goat, sheep or 
monkey) with the protein or a fragment thereof. Antibodies may 
be obtained from immunised animals using any of a variety of 
techniques known in the art, and might be screened, preferably 
using binding of antibody to antigen of interest. For 
instance, Western blotting techniques or immunoprecipitat ion 
may be used (Armitage et al, 1992, Nature 357: 80-82) . 
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Antibodies may be polyclonal or monoclonal. 

As an alternative or supplement to immunising a mammal, 
antibodies with appropriate binding specificity may be obtained 
from a recombinant ly produced library of expressed 
immunoglobulin variable domains, eg using lambda bacteriophage 
or filamentous bacteriophage which display functional 
immunoglobulin binding domains on their surfaces; for instance 
see WO92/01047 . 

Antibodies raised to a polypeptide or peptide can be used 
in the identification and/or isolation of homologous 
polypeptides, and then the encoding genes. Thus, the present 
invention provides a method of identifying or isolating a" 
polypeptide with Mlo or mlo function (in accordance with 
embodiments disclosed herein) , comprising screening candidate 
peptides or polypeptides with a polypeptide comprising the 
antigen-binding domain of an antibody (for example whole 
antibody or a fragment thereof) which is able to bind an Mlo or 
mlo peptide, polypeptide or fragment, variant or variant 
thereof or preferably has binding specificity for such a 
peptide or polypeptide, such as having an amino acid sequence 
identified herein. Specific binding members such as antibodies 
and polypeptides comprising antigen binding domains of 
antibodies that bind and are preferably specific for a Mlo or 
mlo peptide or polypeptide or mutant, variant or derivative 
thereof represent further aspects of the present invention, as 
do their use and methods which employ them. 

Candidate peptides or polypeptides for screening may for 
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instance be the products of an expression library created using 
nucleic acid derived from an plant of interest, or may be the 
product of a purification process from a natural source. 

A peptide or polypeptide found to bind the antibody may be 
5 isolated and then may be subject to amino acid sequencing. Any 
suitable technique may be used to sequence the peptide or 
polypeptide either wholly or partially (for instance a fragment 
of a polypeptide may be sequenced) . Amino acid sequence 
information may be used in obtaining nucleic acid encoding the 

10 peptide or polypeptide, for instance by designing one or more 
oligonucleotides (e.g. a degenerate pool of oligonucleotides) 
for use as probes or primers in hybridisation to candidate 
nucleic acid, or by searching computer sequence databases, as 
discussed further below. 

15 A further aspect of the present invention provides a 

method of identifying and cloning Mlo homologues from plants, 
including species other than Barley, which method employs a 
nucleotide sequence derived from that shown in Figure 2 . 
Further similar aspects employ a nucleotide sequence derived 

2 0 from any of the other Figures provided herein. Nucleic acid 

libraries may be screened using techniques well known to those 
skilled in the art and homologous sequences thereby identified 
then tested. The provision of sequence information for the Mlo 
gene of Barley and various homologues enables the obtention of 

25 homologous sequences from Barley and other plant species, as 
exemplified further herein. 

Also, one can easily derive PCR primers based on putative 
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exon sequences, which might be identified by comparison with 
the Mlo sequence provided in Figure 2 wherein exons are 
highlighted, and perform RT-PCR with total RNA from the plant 
of interest, e.g. barley and rice for the homologues shown in 
5 Figures 5 and 6, with cDNA and amino acid sequences shown in 
other figures herein. 

The homologues whose nucleotide sequences are given and 
whose amino acid sequences are given or are deducible represent 
and provide further aspects of the present invention in 
10 accordance with those disclosed for the Barley gene shown in 
Figure 2 . 

The present invention also extends to nucleic acid 
encoding a Mlo homologue obtained using a nucleotide sequence 
derived from that shown in Figure 2, or the amino acid sequence 

15 shown in Figure 2. Preferably, the nucleotide sequence and/or 
amino acid sequence shares homology with the sequence encoded 
by the nucleotide sequence of Figure 2, preferably at least 
about 50%, or at least about- 55%, or at least about 60%, or at 
least about 65%, or at least about 70%, or at least about 75%, 

20 or at least about 80% homology, or at least about 85% homology, 
or at least about 90% homology, most preferably at least about 
95% homology. "Homology" in relation to an amino acid sequence 
may be used to refer to identity or similarity, preferably 
identity. High levels of amino acid identity may be limited to 

25 functionally significant domains or regions. 

A mutant, allele, variant or derivative amino acid 
sequence in accordance . with the present invention may include 
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within the sequence shown in Figure 2, a single amino acid 
change with respect to the sequence shown in Figure 2, or 2; 3, 
4, 5, 6, 7, 8, or 9 changes, about 10, 15, 20, 30, 40 or 50 
changes, or greater than about 50, 60, 70, 80 or 90 changes. 
5 In addition to one or more changes within the amino acid 
sequence shown in Figure 2, a mutant, allele, variant or 
derivative amino acid sequence may include additional amino 
acids at the C- terminus and/or N- terminus. 

As is well -understood, homology at the amino acid level is 

10 generally in terms of amino acid similarity or identity. 
Similarity allows for "conservative variation", i.e. 
substitution of one hydrophobic residue such as isoleucine, 
valine, leucine or methionine for another, or the substitution 
of one polar residue for another, such as arginine for lysine, 

15 glutamic for aspartic acid, or glutamine for asparagine . 

Similarity may be as defined and determined by the TBLASTN 
program, of Altschul et al . (1990) J. Mol . Biol. 215; 403-10, 
which is in standard use in the art, or, and this may be 
preferred, the standard program BestFit, which is part of the 

20 Wisconsin Package, Version 8, September 1994, (Genetics 

Computer Group, 575 Science Drive, Madison, Wisconsin, USA, 
Wisconsin 53711) . BestFit makes an optimal alignment of the 
best segment of similarity between two sequences. Optimal 
alignments are found by inserting gaps to maximize the number 

25 of matches using the local homology algorithm of Smith and 
Waterman 

Homology may be over the full-length of the relevant 
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sequence shown herein, or may more preferably be over a 
contiguous sequence of about or greater than about 20, 25, 30, 
33, 40, 50, 67, 133, 167, 200, 233, 267, 300, 333, 400, 450, 
500, 550, 600 or more amino acids or codons, compared with the 
5 relevant amino acid sequence or nucleotide sequence as the case 
may be . 

The EST sequences provided herein, have on average 70% 
similarity and 50% identity with the Mlo amino acid sequence of 
Figure 2. We show that the rice homologue (Figure 5) and 

10 barley homologue (Figure 6) have an amino acid identity of 81% 
(amino acid sequences shown in Figure 13 and Figure 14) . 

In certain embodiments, an allele, variant, derivative, 
mutant or homologue of the specific sequence may show little 
overall homology, say about 2 0%, or about 2 5%, or about 3 0%, or 

15 about 35%, or about 40% or about 45%, with the specific 

sequence. However, in functionally significant domains or 
regions the amino acid homology may be much higher. Putative 
functionally significant domains or regions can be identified 
using processes of bioinf ormatics , including comparison of the 

20 sequences of homologues. Functionally significant domains or 
regions of different polypeptides may be combined for 
expression from encoding nucleic acid as a fusion protein. For 
example, particularly advantageous or desirable properties of 
different homologues may be combined in a hybrid protein, such 

25 that the resultant expression product, with Mlo or mlo 

function, may comprise fragments of various parent proteins. 
The nucleotide sequence information provided herein, or 
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any part thereof, may be used in a data-base search to find 
homologous sequences, expression products of which can be 
tested for Mlo or mlo function. These may have ability to 
complement a mlo mutant phenotype in a plant or may, upon 
5 expression in a plant, confer a mlo phenotype. 

In public sequence databases we recently identified 
several homologues for the sequence of Figure 2 . We have 
already found homologues in rice and barley, and the dicot . 
Arabidopsis . 

10 By sequencing homologues, studying their expression • 

patterns and examining the effect of altering their expression, 
genes carrying out a similar function to Mlo in Barley are 
obtainable. Of course, mutants, variants and alleles of these 
sequences are included within the scope of the present 

15 invention in the same terms as discussed above for the Barley 
gene . 

Homology between the homologues as disclosed herein, may 
be exploited in the identification of further homologues, for 
example using oligonucleotides (e.g. a degenerate pool) 

20 designed on the basis of sequence conservation. 

According to a further aspect, the present invention 
provides a method of identifying or a method of cloning a Mlo 
homologue, e.g. from a species other than Barley, the method 
employing a nucleotide sequence derived from that shown in 

25 Figure 2 or that shown in any of the other Figures herein. For 
instance, such a method may employ an oligonucleotide or 
oligonucleotides which comprises or comprise a sequence or 
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sequences that are conserved between the sequences of Figures 2 
and/or 5 and/or 6 and/or 10 and/or 11 and/or 12, or encoding an 
amino acid sequence conserved between Figure 2 and/or 7 and/or 
13 and/or 14 and/or 15 to search for homologues. Thus, a 
method of obtaining nucleic acid is provided, comprising 
hybridisation of an oligonucleotide or a nucleic acid molecule 
comprising such an oligonucleotide to target /candidate nucleic 
acid. Target or candidate nucleic acid may, for example, 
comprise a genomic or cDNA library obtainable from an organism 
known to contain or suspected of containing such nucleic acid, 
either monocotyledonous or dicotyledonous. Successful 
hybridisation may be identified and target /candidate nucleic 
acid isolated for further investigation and/or use. 

Hybridisation may involve probing nucleic acid and 
identifying positive hybridisation under suitably stringent 
conditions (in accordance with known techniques) and/or use of 
oligonucleotides as primers in a method of nucleic acid 
amplification, such as PCR. For probing, preferred conditions 
are those which are stringent enough for there to be a simple 
pattern with a small number of hybridisations identified as 
positive which can be investigated further. It is well known 
in the art to increase stringency of hybridisation gradually 
until only a few positive clones remain. 

As an alternative to probing, though still employing 
nucleic acid hybridisation, oligonucleotides designed to 
amplify DNA sequences may be used in PCR reactions or other 
methods involving amplification of nucleic acid, using routine 
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procedures. See for instance M PCR protocols; A Guide to 
Methods and Applications " , Eds. Innis et al , 1990, Academic 
Press, New York. 

Preferred amino acid sequences suitable for use in the 
design of probes or PCR primers for some purposes are sequences 
conserved (completely, substantially or partly) between at 
least two Mlo peptides or polypeptides encoded by genes able to 
suppress a defence response in a plant, e.g. with any of the 
amino acid sequences of any of the various figures herein 
and/or encoded by the nucleotide sequences of any of the 
various figures herein. 

On the basis of amino acid sequence information 
oligonucleotide probes or primers may be designed, taking into 
account the degeneracy of the genetic code, and, where 
appropriate, codon usage of the organism from the candidate 
nucleic acid is derived. 

Preferably an oligonucleotide in accordance with certain 
embodiments of the invention, e.g. for use in nucleic acid 
amplification, is up to about 50 nucleotides, or about 40 
nucleotides or about 30 or fewer nucleotides in length (e.g. 
18, 21 or 24) . 

Assessment of whether or not such a PCR product 
corresponds to Mlo homologue genes may be conducted in various 
ways. A PCR band from such a reaction might contain a complex 
mix of products. Individual products may be cloned and each 
one individually screened. It may be analysed by 
transformation to assess function on introduction into a plant 
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of interest . 

As noted, nucleic acid according to the present invention 
is obtainable using oligonucleotides, designed on the basis of 
sequence information provided herein, as probes or primers. 
Nucleic acid isolated and/or purified from one or more cells of 
barley or another plant (see above) , or a nucleic acid library- 
derived from nucleic acid isolated and/or purified from the 
plant (e.g. a cDNA library derived from mRNA isolated from the 
plant) , may be probed under conditions for selective 
hybridisation and/or subjected to a specific nucleic acid 
amplification reaction such as the polymerase chain reaction 
(PGR) . The nucleic acid probed or used as template in the 
amplification reaction may be genomic DNA, cDNA or RNA. If 
necessary, one or more gene fragments may be ligated to 
generate a full-length coding sequence. 

We have tested several PCR primers derived from the Mlo 
sequence disclosed herein to test their specificity for 
amplifying nucleic acid according to the present invention, 
using both barley genomic DNA and RT-PCR templates. The latter 
was synthesized from barley polyA + RNA. In each case we were 
able to amplify the expected Mlo derived gene fragments as 
shown by cloning and subsequent DNA sequencing of the PCR 
products. Full length cDNA clones can be obtained as described 
by 5' and 3' RACE technology if RT-PCR products are used as 
templates . 

Examples of primers tested include: 
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25L 


5' 


-GTG 


CAT 


CTG 


CGT 


GTG 


CGT 


A- 3 ' 




25LN 


5' 


-GTG 


TGC 


GTA 


CCT 


GGT 


AGA 


G-3 ' 




25R 


5' 


-AAC 


GAC 


GTC 


TGG 


TGC 


GTG- 


-3 ' 




33 


5' 


-TGC 


AGC 


TAT 


ATG 


ACC 


TTC 


CCC 


CTC 


37 


5' 


-GGA 


CAT 


GCT 


GAT 


GGC 


TCA 


GA-3 ' 


38 


5' 


-GAG 


AAC 


TTG 


TCT 


CAT 


CCC 


TG-3 




3 8A 


5' 


-GGC 


TAT 


ACA 


TTG 


GGA 


CTA 


ACA- 


3' 


3 8B 


5' 


-CGA 


ATC 


ATC 


ACA 


TCC 


TAT 


GTT- 


3' 


39 


5' 


-GCA 


AGT 


TCG 


ACT 


TCC 


AC- 3' 




3 9A 


5' 


-TCG 


ACT 


TCC 


ACA 


AGT 


ACA 


TCA- 


3' 


53 


5' 


-AGC 


GTA 


CCT 


GCG 


TAC 


GTA 


G-3' 





Various primer combinations have been tested: 
38/39A; 38/39; 38/33; 38/37; 38A/39A; 38B/39A; 38/25L; 38/25LN; 
25R/25L; 25R/25LN; 25R/53 . 

15 

Various aspects of the present invention include the 
obtainable nucleic acid, methods of screening material, e.g. 
cell lysate, nucleic acid preparations, for the presence of 
nucleic acid of interest, methods of obtaining the nucleic 
2 0 acid, and the primers and primer combinations given above. 



The sequence information provided herein also allows the 
design of diagnostic tests for determination of the presence of 
a specific mlo resistance allele, or a susceptibility allele 
25 (e.g. wild-type), in any given plant, cultivar, variety, 

population, landrace, part of a family or other selection in a 
breeding programme or other such genotype. A diagnostic test 
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may be based on determination of the presence or absence of a 
particular allele by means of nucleic acid or polypeptide 
determination . 

At the nucleic acid level, this may involve hybridisation 
of a suitable oligo- or poly-nucleot ide , such as a fragment of 
the Mlo gene or a homologue thereof, including any homologue 
disclosed herein, or any particular allele, such as an allele 
which gives an mlo phenotype, such as any such allele disclosed 
herein. The hybridisation may involve PCR designed to amplify 
a product from a given allelic version of mlo, with subsequent 
detection of an amplified product by any of a number of 
possible methods including but not limited to gel 
electrophoresis, capillary electrophoresis, direct 
hybridisation of nucleotide sequence probes and so on. A 
diagnostic test may be based on PCR designed to amplify various 
alleles or any allele from the Mlo locus, with a test to 
distinguish the different possible alleles by any of a number 
of possible methods, including DNA fragment size, restriction 
site variation (e.g. CAPS - cleaved amplified polymorphic 
sites) and so on. A diagnostic test may also be based on a 
great number of possible variants of nucleic acid analysis that 
will be apparent to those skilled in the art, such as use of a 
synthetic mlo-derived sequence as a hybridisation probe. 

Broadly, the methods divide into those screening for the 
presence of nucleic acid sequences and those that rely on 
detecting the presence or absence of a polypeptide. The 
methods may make use of biological samples from one or more 
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plants or cells that are suspected to contain the nucleic acid 
sequences or polypeptide. 

Exemplary approaches for detecting nucleic acid or 
polypeptides include analysing a sample from the plant or plant 
5 cell by: 

(a) comparing the sequence of nucleic acid in the sample 
with all or part of the nucleotide sequence shown in Figure 7 
to determine whether the sample from the patient contains a 
mutation; 

10 (b) determining the presence in the sample of a 

polypeptide including the amino acid sequence shown in Figure 2 
or a fragment thereof and, if present, determining whether the 
polypeptide is full length, and/or is mutated, and/or is 
expressed at the normal level; 

15 (c) performing DNA fingerprinting to compare the 

restriction pattern produced when a restriction enzyme cuts 
nucleic acid in the sample with the restriction pattern 
obtained from the nucleotide sequence shown in Figure 7 or from 
a known mutant, allele or variant thereof; 

20 (d) contacting the sample with a specific binding member 

capable of binding to nucleic acid including the nucleotide 
sequence as set out in Figure 7 or a fragment thereof, or a 
mutant, allele or variant thereof, the specific binding member 
including nucleic acid hybridisable with the sequence of Figure 

25 7 or a polypeptide including a binding domain with specificity 
for nucleic acid including the sequence of Figure 7 or the 
polypeptide encoded by it, or a mutated form thereof, and 



WO 98/04586 



PCT/GB97/02U46 



22 

determining binding of the specific binding member; 

(e) performing PCR involving one or more primers based on 
the nucleotide sequence shown in Figure 7 to screen the sample 
for nucleic acid including the nucleotide sequence of Figure 7 
or a mutant, allele or variant thereof. 

When screening for a resistance allele nucleic acid, the 
nucleic acid in the sample will initially be amplified, e.g. 
using PCR, to increase the amount of the analyte as compared to 
other sequences present in the sample . This allows the target 
sequences to be detected with a high degree of sensitivity if 
they are present in the sample. This initial step may be 
avoided by using highly sensitive array techniques that are 
becoming increasingly important in the art. 

A variant form of the gene may contain one or more 
insertions, deletions, substitutions and/or additions of one or 
more nucleotides compared with the wild-type sequence (such as 
shown in Table 1) which may or may not disrupt the gene 
function. Differences at the nucleic acid level are not 
necessarily reflected by a difference in the amino acid 
sequence of the encoded polypeptide. However, a mutation or 
other difference in a gene may result in a frame-shift or stop 
codon, which could seriously affect the nature of the 
polypeptide produced (if any) , or a point mutation or gross 
mutational change to the encoded polypeptide, including 
insertion, deletion, substitution and/or addition of one or 
more amino acids or regions in the polypeptide. A mutation in 
a promoter sequence or other regulatory region may prevent or 
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reduce expression from the gene or affect the processing or 
stability of the mRNA transcript. 

Tests may be carried out on preparations containing 
genomic DNA, cDNA and/or mRNA . Testing cDNA or mRNA has the 
5 advantage of the complexity of the nucleic acid being reduced 
by the absence of intron sequences, but the possible 
disadvantage of extra time and effort being required in making 
the preparations. RNA is more difficult to manipulate than DNA 
because of the wide -spread occurrence of RN'ases. 

10 Nucleic acid in a test sample may be sequenced and the 

sequence compared with the sequence shown in Figure 2, or other 
figure herein, to determine whether or not a difference is 
present. If so, the difference can be compared with known 
susceptibility alleles (e.g. as summarised in Table 1) to 

15 determine whether the test nucleic acid contains one or more of 
the variations indicated, or the difference can be investigated 
for association with disease resistance. 

The amplified nucleic acid may then be sequenced as above, 
and/or tested in any other way to determine the presence or 

20 absence of a particular feature. Nucleic acid for testing may 
be prepared from nucleic acid removed from cells or in a 
library using a variety of other techniques such as restriction 
enzyme digest and electrophoresis. 

Nucleic acid may be screened using a variant- or allele- 

25 specific probe. Such a probe corresponds in sequence to a 

region of the gene, or its complement, containing a sequence 
alteration known to be associated with disease resistance. 
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Under suitably stringent conditions, specific hybridisation of 
such a probe to test nucleic acid is indicative of the presence 
of the sequence alteration in the test nucleic acid. For 
efficient screening purposes, more than one probe may be used 
5 on the same test sample. 

Allele- or variant - specif ic oligonucleotides may similarly 
be used in PCR to specifically amplify particular sequences if 
present in a test sample. Assessment of whether a PCR band 
contains a gene variant may be carried out in a number of ways 

10 familiar to those skilled in the art. The PCR product may for 
instance be treated in a way that enables one to display the 
mutation or polymorphism on a denaturing polyacrylamide DNA 
sequencing gel, with specific bands that are linked to the gene 
variants being selected. 

15 An alternative or supplement to looking for the presence 

of variant sequences in a test sample is to look for the 
presence of the normal sequence, e.g. using a suitably specific 
oligonucleotide probe or primer. 

Approaches which rely on hybridisation between a probe and 

20 test nucleic acid and subsequent detection of a mismatch may be 
employed. Under appropriate conditions (temperature, pH etc.), 
an oligonucleotide probe will hybridise with a sequence which 
is not entirely complementary. The degree of base-pairing 
between the two molecules will be sufficient for them to anneal 

25 despite a mis-match. Various approaches are well known in the 
art for detecting the presence of a mis-match between two 
annealing nucleic acid molecules. 
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For instance, RN'ase A cleaves at the site of a mis-match. 
Cleavage can be detected by electrophoresing test nucleic acid 
to which the relevant probe or probe has annealed and looking 
for smaller molecules (i.e. molecules with higher 
5 electrophoretic mobility) than the full length probe/test 

hybrid. Other approaches rely on the use of enzymes such as 
resolvases or endonucleases . 

Thus, an oligonucleotide probe that has the sequence of a 
region of the normal gene {either sense or anti-sense strand) 

10 in which mutations associated with disease resistance are known 
to occur (e.g. see Table 1) may be annealed to test nucleic 
acid and the presence or absence of a mis-match determined. 
Detection of the presence of a mis-match may indicate the 
presence in the test nucleic acid of a mutation associated with 

15 disease resistance. On the other hand, an oligonucleotide 

probe that has the sequence of a region of the gene including a 
mutation associated with disease resistance may be annealed to 
test nucleic acid and the presence or absence of a mis-match 
determined. The presence of a mis -match may indicate that the 

20 nucleic acid in the test sample has the normal sequence, or a 

different mutant or allele sequence. In either case, a battery 
of probes to different regions of the gene may be employed. 

The presence of differences in sequence of nucleic acid 
molecules may be detected by means of restriction enzyme 

25 digestion, such as in a method of DNA fingerprinting where the 
restriction pattern produced when one or more restriction 
enzymes are used to cut a sample of nucleic acid is compared 
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with the pattern obtained when a sample containing the normal 
gene or a variant or allele is digested with the same enzyme or 
enzymes . 

The presence of absence of a lesion in a promoter or other 
regulatory sequence may also be assessed by determining the 
level of mRNA production by transcription or the level of 
polypeptide production by translation from the mRNA. 

Nucleic acid isolated and/or purified from one or more 
cells of a plant or a nucleic acid library derived from nucleic 
acid isolated and/or purified from cells (e.g. a cDNA library 
derived from mRNA isolated from the cells) , may be probed under 
conditions for selective hybridisation and/or subjected to a 
specific nucleic acid amplification reaction such as the 
polymerase chain reaction (PCR) . 

A method may include hybridisation of one or more (e". g. 
two) probes or primers to target nucleic acid. Where the 
nucleic acid is double -stranded DNA, hybridisation will 
generally be preceded by denaturation to produce single- 
stranded DNA. The hybridisation may be as part of a PCR 
procedure, or as part of a probing procedure not involving PCR. 
An example procedure would be a combination of PCR and low 
stringency hybridisation. A screening procedure, chosen from 
the many available to those skilled in the art, is used to 
identify successful hybridisation events and isolate hybridised 
nucleic acid. 

Binding of a probe to target nucleic acid (e.g. DNA) may 
be measured using any of a variety of techniques at the 
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disposal of those skilled in the art. For instance, probes may 
be radioactively , fluorescent ly or enzymatically labelled. 
Other methods not employing labelling of probe include 
examination of restriction fragment length polymorphisms, 
5 amplification using PGR, RNAase cleavage and allele specific 
oligonucleotide probing . 

Probing may employ the standard Southern blotting 
technique. For instance DNA may be extracted from cells and 
digested with different restriction enzymes. Restriction 

10 fragments may then be separated by electrophoresis on an 
agarose gel, before denaturation and transfer to a 
nitrocellulose filter. Labelled probe may be hybridised to the 
DNA fragments on the filter and binding determined. DNA for 
probing may be prepared from RNA preparations from cells. 

15 Preliminary experiments may be performed by hybridising 

under low stringency conditions various probes to Southern 
blots of DNA digested with restriction enzymes. Suitable 
conditions would be achieved when a large number of hybridising 
fragments were obtained while the background hybridisation was 

20 low. Using these conditions nucleic acid libraries, e.g. cDNA 
libraries representative of expressed sequences, may be 
searched . 

As noted, those skilled in the art are well able to employ 
suitable conditions of the desired stringency for selective 
25 hybridisation, taking into account factors such as 

oligonucleotide length and base composition, temperature and so 
on . 
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In some preferred embodiments of diagnostic assays 
according to the present invention, oligonucleotides according 
to the present invention that are fragments of any of the 
sequences shown in Figure 2, or any allele associated with 
disease resistance, e.g. as identified in Table 1, are at least 
about 10 nucleotides in length, more preferably at least about 
15 nucleotides in length, more preferably at least about 20 
nucleotides in length, more preferably about 30 nucleotides in 
length. Such fragments themselves individually represent 
aspects of the present invention. Fragments and other 
oligonucleotides may be used as primers or probes as discussed 
but may also be generated (e.g. by PCR) in methods concerned 
with determining the presence in a test sample of a sequence 
indicative of disease resistance. 

There are various methods for determining the presence or 
absence in a test sample of a particular polypeptide, such as 
the polypeptide with the amino acid sequence shown in Figure 2 , 
or other figure herein, or an amino acid sequence mutant, 
variant or allele thereof (e.g. including an alteration shown 
in Table 1) . 

A sample may be tested for the presence of a binding 
partner for a specific binding member such as an antibody (or 
mixture of antibodies) , specific for one or more particular 
variants of the polypeptide shown in Figure 2, e.g. see Table 
1 . 

In such cases, the sample may be tested by being contacted 
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with a specific binding member such as an antibody under 
appropriate conditions for specific binding, before binding is 
determined, for instance using a reporter system as discussed. 
Where a panel of antibodies is used, different reporting labels 
5 may be employed for each antibody so that binding of each can 
be det ermined . 

A specific binding member such as an antibody may be used 
to isolate and/or purify its binding partner polypeptide from a 
test sample, to allow for sequence and/or biochemical analysis 
10 of the. polypeptide to determine whether it has the sequence 

and/or properties of the wild-type polypeptide or a particular 
mutant, variant or allele thereof. Amino acid sequence is 
routine in the art using automated sequencing machines. 

15 The use of diagnostic tests for mlo alleles allows the 

researcher or plant breeder to establish, with full confidence 
and independent from time consuming resistance tests, whether 
or not a desired allele is present in the plant of interest (or 
a cell thereof) , whether the plant is a representative of a 

20 collection of other genetically identical plants (e.g. an 

inbred variety or cultivar) or one individual in a sample of 
related (e.g. breeders' selection) or unrelated plants. The 
mlo alleles conferring the desirable disease resistance 
phenotype are recessive, and are not therefore detectable at 

25 the whole plant phenotype level when in a heterozygous 
condition in the presence of a wild-type Mlo allele. 
Phenotypic screening for the presence of such recessive alleles 
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is therefore only possible on material homozygous for the mlo 
locus and so delays substantially the generation in a plant 
breeding programme at which selection can be reliably and cost- 
effectively applied. In a backcross breeding programme where, 
for example, a breeder is aiming to introgress a desirable mlo 
allele into an elite adapted high performing target genotype, 
the mlo locus will be permanently in the heterozygous condition 
until selfing is carried out. Nucleic acid or polypeptide 
testing for the presence of the recessive allele avoids the 
need to test selfed progeny of backcross generation 
individuals, thus saving considerable time and money. In other 
types of breeding scheme based on selection and selfing of 
desirable individuals, nucleic acid or polypeptide diagnostics 
for the desirable mlo alles in high throughput, low cost assays 
as provided by this invention, reliable selection for the 
desirable mlo alleles can be made at early generations and on 
more material than would otherwise be possible. This gain in 
reliability of selection plus the time saving by being able to 
test material earlier and without costly resistance phenotype 
screening is of considerable value in plant breeding. 

By way of example for nucleic acid testing, the barley 
mlo-5 resistance allele is characterized by a G- to A- 
nucleotide substitution in the predicted start codon of the Mlo 
gene (Table 1) . The mutation may easily be detected by 
standard PCR amplification of a Mlo gene segment from genomic 
template DNA with the primers: 

forward primer : 5 ' - GTTGCC ACACTTTGCC ACG - 3 ' 
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reverse primer: 5 ' -AAGCCAAGACGACAATCAGA- 3 ' 

(for example) , followed by digestion witht he restriction 
enzyme PshAl . This generates a cleaved amplified polymorphic 
sequences (CAPS) marker which may be displayed using 
5 conventional agarose gel electrophoresis. Presence of a 769 bp 
fragment is indicative of the presence of the mlo-5 allele. 

The mlo-9 resistance allele is characterized by a C- to T- 
nucleotide substitution (Table 1) . This allele is of 
particular relevance since it is used frequently in breeding 
10 material. The mutational event may be easily detected using 
the primers : 

forward primer 5 ' -GRRGCCACACTTTGCCACG- 3 ' 
reverse primer 5 ' -AAGCCAAGACGACAATCAGA-3 ' 
(for example) and subsequent digestion of genomic amplification 
15 products with the restriction enzyme Hhal . This generates a 

CAPS marker which may be displayed by conventional agarose gel 
electrophoresis. The presence of a 374 bp fragment is 
indicative of the presence of mlo-9. 

A third, particularly interesting allele is mlo-12 r 
20 characterised by a substitution a residue 240, specifically a 
Phe240 to leucine replacement. This may result from a C720 to 
A substitution in the encoding nucleotide sequence (Table 1) . 
This is the only currently documented mlo allele for which 
conclusive evidence is available that the altered protein 
25 retains residual wild-type activity (Hentrich, 1979, Arch. 

Zilchtungsvorsch. , Berlin 9, S. 283-291). mlo-12 exhibits no 
detectable spontaneous cell death reaction but confers a 
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sufficient level of resistance to pathogens such as the powdery 
mildew fungus. mlo-12 may therefore be the allele of choice in 
breeding programs if minimal pleiotropic effects (spontaneous 
cell death) are desirable after introgression of the mlo 
resistance in elite breeding lines. Furthermore, the molecular 
site of the amino acid substitution within the Mlo protein 
allows the design of alleles with a residual wild-type 
activity, and also the obtention of interacting and/or 
inhibitory molecules, reducing undesirable pleiotropic effects 
from a complete loss of function of the Mlo protein. 

Nucleic acid-based determination of the presence or 
absence of mlo alleles may be combined with determination of 
the genotype of the flanking linked genomic DNA and other 
unlinked genomic DNA using established sets of markers such as 
RFLPs , microsatellites or SSRs, AFLPs, RAPDs etc. This enables 
the researcher or plant breeder to select for not only the 
presence of the desirable mlo allele but also for individual 
plant or families of plants which have the most desirable 
combinations of linked and unlinked genetic background. Such 
recombinations of desirable material may occur only rarely 
within a given segregating breeding population or backcross 
progeny. Direct assay of the mlo locus as afforded by the 
present invention allows the researcher to make a stepwise 
approach to fixing (making homozygous) the desired combination 
of flanking markers and mlo alleles, by first identifying 
individuals fixed for one flanking marker and then identifying 
progeny fixed on the other side of the mlo locus all the time 
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knowing with confidence chat the desirable mlo allele is still 
present . 

The present disclosure provides sufficient information for 
a person skilled in the art to obtain genomic DNA sequence for 
any given new or existing mlo allele and devise a suitable 
nucleic acid- and/or polypeptide-based diagnostic assay. 
Existing mlo alleles to which this may be applied include, for 
example, mlo- 1 , mlo- 3 , mlo- A , mlo- 5 , mlo- 6 , mlo-1 , mlo- 8 , mlo- 
9, mlo-10, mlo-12, mlo- 13 f mlo- 16, mlo- 17, mlo-26 and mIo-28, 
for all of which sequence information is provided herein (see 
e.g. Figure 2 and Table 1). In designing a nucleic acid assay 
account is taken of the distinctive variation in sequence that 
characterises the particular variant allele. Thus, the present 
invention extends to an oligonucleotide fragment of a mlo 
allele, having a sequence which allows it to hybridise 
specifically to that allele as compared with other mlo alleles. 
Such an oligonucleotide spans a nucleotide at which a mlo 
mutation occurs, and may include the mutated nucleotide at or 
towards its 3' or 5' end. Such an oligonucleotide may 
hybridise with the sense or anti- sense strand. The variation 
may be within the coding sequence of the mlo gene, or may lie 
within an intron sequence or in an upstream or downstream non- 
coding sequence, wherein disruption affects or is otherwise 
related to the lesion in Mlo that results in the mildew 
resistant phenotype . 

The mlo- 9 allele is widely but not exclusively used in 
plant breeding (J Helms Jorgensen - Euphytica (1992) 63: 141- 
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152), mlo-11 is also used. Use of mlo mutants in practical 
breeding has largely been restricted to spring barley, because 
the spontaneous cell death response associated with many of the 
mutant alleles appears to represent a penalty to plant growth 
and performance when incorporated into high yielding winter 
barley genotypes. However different mlo alleles have 
different degrees of associated spontaneous cell death 
response, and thus some, either existing or newly created from 
mutagenesis programmes or isolated as spontaneous mutants, are 
more suitable than others for incorporation into winter barley 
backgrounds. The mlo- 12 allele may be particularly suitable 
since no detectable pleiotropic effects occur despite 
conferring a sufficient level of pathogen resistance. The use 
of mlo based mildew resistance more widely in winter barleys 
will have significant value for barley growers as well as 
significant economic and environmental implications such as 
reduced use of fungicide inputs with their associated treatment 
costs. The provision of nucleic acid diagnostics as provided 
herein enables rapid and accurate deployment of new and 
existing mlo alleles into winter barley germplasm. 

Plants which include a plant cell according to the 
invention are also provided, along with any part or propagule 
thereof, seed, selfed or hybrid progeny and descendants. A 
plant according to the present invention may be one which does 
not breed true in one or more properties. Plant varieties may 
be excluded, particularly registrable plant varieties according 
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co Plant Breeders' Rights. It is noted that a plant need not 
be considered a "plant variety" simply because it contains 
stably within its genome a transgene, introduced into a cell of 
the plant or an ancestor thereof. 
5 In addition to a plant, the present invention provides any 

clone of such a plant, seed, selfed or hybrid progeny and 
descendants, and any part of any of these, such as cuttings, 
seed. The invention provides any plant propagule, that is any 
part which may be used in reproduction or propagation, sexual 

10 or asexual, including cuttings, seed and so on. Also 

encompassed by the invention is a plant which is a sexually or 
asexually propagated off-spring, clone or descendant of such a 
plant, or any part or propagule of said plant, off -spring, 
clone or descendant . 

15 A further aspect of the present invention provides a 

method of making a plant cell involving introduction of the 
sequence (e.g. as part of a suitable vector) into a plant cell 
and causing or allowing recombination between the vector and 
the plant cell genome to introduce the sequence of nucleotides 

20 into the genome. 

Following transformation of a plant cell a plant may be 
regenerated. 



The invention further provides a method of modulating Mlo 
25 expression in a plant, which may modulate a defence response in 
the plant, comprising expression of a heterologous Mlo gene 
sequence (or mutant, allele, variant or homologue thereof, as 



WO 98/04586 



PCTYGB97/02046 



36 

discussed) within cells of the plant. As discussed further 
herein, modulation or alteration of the level of constitutive 
defence response in a plant may be by way of suppression, 
repression or reduction (in the manner of wild- type Mlo) or 
promotion, stimulation, activation, increase, enhancement or 
augmentation (in the manner of mutant mlo) . Activation or 
enhancement of the defence response may confer or increase 
pathogen resistance of the plant, especially resistance to 
powdery mildew and/or rust (such as yellow rust) . 

The term "heterologous" may be used to indicate that the 
gene/sequence of nucleotides in question have been introduced 
into said cells of the plant or an ancestor thereof, using 
genetic engineering, ie by human intervention. A transgenic 
plant cell, i.e. transgenic for the nucleic acid in question, 
may be provided. The transgene may be on an extra-genomic 
vector or incorporated, preferably stably, into the genome. A 
heterologous gene may replace an endogenous equivalent gene, ie 
one which normally performs the same or a similar function, or 
the inserted sequence may be additional to the endogenous gene 
or other sequence. An advantage of introduction of a 
heterologous gene is the ability to place expression of a 
sequence under the control of a promoter of choice, in order to 
be able to influence expression according to preference, such 
as under particular developmental, spatial or temporal control, 
or under control of an inducible promoter. Furthermore, 
mutants, variants and derivatives of the wild- type gene, e.g. 
with higher or lower activity than wild- type, may be used in 
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place of the endogenous gene. Nucleic acid heterologous, or 
exogenous or foreign, to a plant cell may be non-naturally 
occuring in cells of that type, variety or species. Thus, 
nucleic acid may include a coding sequence of or derived from a 
5 particular type of plant cell or species or variety of plant, 
placed within the context of a plant cell of a different type 
or species or variety of plant. A further possibility is for a 
nucleic acid sequence to be placed within a cell in which it or 
a homologue is found naturally, but wherein the nucleic acid 

10 sequence is linked and/or adjacent to nucleic acid which does 
not occur naturally within the cell, or cells of that type or 
species or variety of plant, such as operably linked to one or 
more regulatory sequences, such as a promoter sequence, for 
control of expression. A sequence within a plant or other host 

15 cell may be identifiably heterologous, exogenous or foreign. 

Down-regulation of wild-type Mlo gene function leads to 
stimulation of a constitutive defence response. This may be 
achieved in a number of different ways, as illustrated below. 

The nucleic acid according to the invention may be placed 

20 under the control of an inducible gene promoter thus placing 
expression under the control of the user. 

In a further aspect the present invention provides a gene 
construct comprising an inducible promoter operatively linked 
to a nucleotide sequence provided by the present invention. As 

25 discussed, this enables control of expression of the gene. The 
invention also provides plants transformed with said gene 
construct and methods comprising introduction of such a 
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construct into a 'plant cell and/or induction of expression of a 
construct within a plant cell, e.g. by application of a 
suitable stimulus, such as an effective exogenous inducer or 
endogenous signal . 

The term "inducible" as applied to a promoter is well 
understood by those skilled in the art. In essence, expression 
under the control of an inducible promoter is "switched on" or 
increased in response to an applied stimulus (which may be 
generated within a cell or provided exogenously) . The nature of 
the stimulus varies between promoters. Some inducible promoters 
cause little or undetectable levels of expression (or no 
expression) in the absence of the appropriate stimulus. Other 
inducible promoters cause detectable constitutive expression in 
the -absence of the stimulus. Whatever the level of expression 
is in the absence of the stimulus, expression from any 
inducible promoter is increased in the presence of the correct 
stimulus. The preferable situation is where the level of 
expression increases upon application of the relevant stimulus 
by an amount effective to alter a phenotypic characteristic. 
Thus an inducible (or "switchable" ) promoter may be used which 
causes a basic level of expression in the absence of the 
stimulus which level is too low to bring about a desired 
phenotype (and may in fact be zero) . Upon application of the 
stimulus, expression is increased (or switched on) to a level 
which brings about the desired phenotype. 

Suitable promoters include the Cauliflower Mosaic Virus 
35S (CaMV 35S) gene promoter that is expressed at a high level 
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in virtually all plant tissues (Benfey et al, (1990a) EMBO J 9: 
1677-1684); the cauliflower meri 5 promoter that is expressed 
in the vegetative apical meristem as well as several well 
localised positions in the plant body, eg inner phloem, flower 
5 primordia, branching points in root and shoot (Medford, J.I. 
(1992) Plant Cell 4, 1029-1039; Medford et al, (1991) Plant 
Cell 3, 359-370) and the Arabidopsis thaliana LEAFY promoter 
that is expressed very early in flower development (Weigel et 
al, (1992) Cell 69, 843-859) . 

10 An aspect of the present invention is the use of nucleic 

acid according to the invention in the production of a 
transgenic plant. 

When introducing a chosen gene construct into a cell, 
certain considerations must be taken into account, well known 

15 to those skilled in the art. The nucleic acid to be inserted 

should be assembled within a construct which contains effective 
regulatory elements which will drive transcription. There must 
be available a method of transporting the construct into the 
cell. Once the construct is within the cell membrane, 

20 integration into the endogenous chromosomal material either 

will or will not occur. Finally, as far as plants are concerned 
the target cell type must be such that cells can be regenerated 
into whole plants. 

Plants transformed with the DNA segment containing the 

25 sequence may be produced by standard techniques which are 

already known for the genetic manipulation of plants. DNA can 
be transformed into plant cells using any suitable technology, 
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such as a disarmed Ti-plasmid vector carried by Agrobacterium 
exploiting its natural gene transfer ability (EP-A-270355 , EP- 
A-0116718, NAR 12(22) 8711 - 87215 1984), particle or 
micropro jectile bombardment (US 5100792, EP-A-444882, EP-A- 
434616) microinjection (WO 92/09696, WO 94/00583, EP 331083, EP 
175966, Green et al . (1987) Plant Tissue and Cell Culture, 
Academic Press), electroporation (EP 290395, WO 8706614) other 
forms of direct DNA uptake (DE 4005152, WO 9012096, US 
4684611), liposome mediated DNA uptake (e.g. Freeman et al . 
Plant Cell Physiol. 29: 1353 (1984)), or the vortexing method 
(e.g. Kindle, PNAS U.S.A. 87: 1228 (1990d) Physical methods for 
the transformation of plant cells are reviewed in Oard, 1991, 
Biotech. Adv. 9: 1-11. 

Agrobacterium transformation is widely used by those 
skilled in the art to transform dicotyledonous species. 
Recently, there has been substantial progress towards the 
routine production of stable, fertile transgenic plants in 
almost all economically relevant monocot plants (Toriyama; et 
al. (1988) Bio/Technology 6, 1072-1074; Zhang, et al . (1988) 
Plant Cell Rep. 7, 379-384; Zhang, et al . (1988) Theor Appl 
Genet 76, 835-840; Shimamoto, et al . (1989) Nature 338, 274- 
276; Datta, et al . (1990) Bio/Technology 8, 736-740; Christou, 
et al. (1991) Bio/Technology 9, 957-962; Peng, et al . (1991) 
International Rice Research Institute, Manila, Philippines 563- 
574; Cao, et al . (1992) Plant Cell Rep. 11, 585-591; Li, et al . 
(1993) Plant Cell Rep. 12, 250-255; Rathore, et al . (1993) 
Plant Molecular Biology 21, 871-884; Fromm, et al . (1990) 
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Bio/Technology 8, 833-839; Gordon-Kamm, et al . (1990) Plant 
Cell 2, 603-618; D'Halluin, et al . (1992) Plant Cell 4, 1495- 
1505; Walters, et al . (1992) Plant Molecular Biology 18, 189- 
200 ; Koziel, et al . (1993) Bi o t echnology 11, 194-200; Vasil, I. 
5 K. (1994) Plant Molecular Biology 25, 925-937; Weeks, et al . 
(1993) Plant Physiology 102, 1077-1084; Somers, et al . (1992) 
Bio/Technology 10, 1589-1594; W092/14828). In particular, 
Agrobacterium mediated transformation is now emerging also as 
an highly efficient alternative transformation method in 

10 monocots (Hiei et al . (1994) The Plant Journal 6, 271-282). 

The generation of fertile transgenic plants has been 
achieved in the cereals rice, maize, wheat, oat, and barley 
(reviewed in Shimamoto, K. (1994) Current Opinion in 
Biotechnology 5, 158-162.; Vasil, et al . (1992) Bio/Technology 

15 10, 667-674; Vain et al . , 1995, Biotechnology Advances 13 (4): 
653-671; Vasil, 1996, Nature Biotechnology 14 page 702) . 

Microproj ectile bombardment, electroporation and direct 
DNA uptake are preferred where Agrobacterium is inefficient or 
ineffective. Alternatively, a combination of different 

20 techniques may be employed to enhance the efficiency of the 
transformation process, eg bombardment with Agrobacterium 
coated microparticles (EP-A-486234 ) or microproj ectile 
bombardment to induce wounding followed by co-cultivation with 
Agrobacterium (EP-A-486233 ) . 

25 Following transformation, a plant may be regenerated, e.g. 

from single cells, callus tissue or leaf discs, as is standard 
in the art. Almost any plant can be entirely regenerated from 
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cells, tissues and organs of the plant. Available techniques 
are reviewd in Vasil et al . , Ce21 Culture and Somatic Cel 
Genetics of Plants, Vol I, II and III, Laboratory Procedures 
and Their Applications , Academic Press, 1984, and Weissbach and 
Weissbach, Methods for Plant Molecular Biology, Academic Press, 
1989 . 

The particular choice of a transformation technology will 
be determined by its efficiency to transform certain plant 
species as well as the experience and preference of the person 
practising the invention with a particular methodology of 
choice. It will be apparent to the skilled person that the 
particular choice of a transformation system to introduce 
nucleic acid into plant cells is not essential to or a 
limitation of the invention, nor is the choice of technique for 
plant regeneration . 

In the present invention, expression may be achieved by 
introduction of the nucleotide sequence in a sense orientation. 
Thus, the present invention provides a method of modulation of 
a defence response in a plant, the method comprising causing or 
allowing expression of nucleic acid according to the invention 
within cells of the plant. Generally, it will be desirable to 

stimulate the defence response, and this may be achieved by 

disrupting Mlo gene function. 

Down-regulation of expression of a target gene may be 

achieved using anti-sense technology or "sense regulation" 

("co-suppression") . 

In using anti-sense genes or partial gene sequences to 



PCT/GB97/02046 

43 

down-regulate gene expression, a nucleotide sequence is placed 
under the control of a promoter in a "reverse orientation" such 
that transcription yields RNA which is complementary to normal 
mRNA transcribed from the "sense 11 strand of the target gene. 
5 See, for example, Rothstein et al, 1987; Smith et al, (1988) 

Nature 334, 724-726; Zhang et al, (1992) The Plant Cell 4, 1575- 
1588, English et al . , (1996) The Plant Cell 8, 179-188. 
Antisense technology is also reviewed in Bourque , (1995), Plant 
Science 105, 125-149, and Flavell, (1994) PNAS USA 91, 3490- 
10 3496. 

An alternative is to use a copy of all or part of the 
target gene inserted in sense, that is the same, orientation as 
the target gene, to achieve reduction in expression of the 
target gene by co-suppression. See, for example, van der Krol 

15 et al., (1990) The Plant Cell 2, 291-299; Napoli et al . , (1990) 
The Plant Cell 2, 279-289; Zhang et al . , (1992) The Plant Cell 
4, 1575-1588, and US-A- 5 , 23 1 , 020 . 

The complete sequence corresponding to the coding sequence 
(in reverse orientation for anti-sense) need not be used. For 

2 0 example fragments of sufficient length may be used. It is a 
routine matter for the person skilled in the art to screen 
fragments of various sizes and from various parts of the coding 
sequence to optimise the level of anti-sense inhibition. It 
may be advantageous to include the initiating methionine ATG 

25 codon, and perhaps one or more nucleotides upstream of the 
initiating codon. A further possibility is to target a 
conserved sequence of a gene, e.g. a sequence that is 
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characteristic of one or more genes, such as a regulatory 
sequence. Antisense constructs may involve 3 ' end or 5 ' end 
sequences of Mlo or homologues. In cases where several Mlo 
homologues exist in a plant species, the involvement of' 5'- and 
3 '-end untranslated sequences in the construct will enhance 
specificity of silencing. 

The sequence employed may be about 500 nucleotides or 
less, possibly about 400 nucleotides, about 300 nucleotides, 
about 200 nucleotides, or about 100 nucleotides. It may be 
possible to use oligonucleotides of much shorter lengths, 14-23 
nucleotides, although longer fragments, and generally even 
longer than about 500 nucleotides are preferable where 
possible, such as longer than about 600 nucleotides, than about 
700 nucleotides, than about 800 nucleotides, than about 1000 
nucleotides, than about 1200 nucleotides, than about 1400 
nucleotides, or more. 

It may be preferable that there is complete sequence 
identity in the sequence used for down- regulation of expression 
of a target sequence, and the target sequence, though total 
complementarity or similarity of sequence is not essential. 
One or more nucleotides may differ in the sequence used from 
the target gene. Thus, a sequence employed in a down- 
regulation of gene expression in accordance with the present 
invention may be a wild- type sequence (e.g. gene) selected from 
those available, or a mutant, derivative, variant or allele, by 
way of insertion, addition, deletion or substitution of one or 
more nucleotides, of such a sequence. The sequence need not 



PCT/GB97/02046 



include an open reading frame or specify an RNA that would be 
translatable. It may be preferred for there to be sufficient 
homology for the respective anti -sense and sense RNA molecules 
to hybridise. There may be down regulation of gene expression 
5 even where there is about 5%, 10%, 15% or 20% or more mismatch 
between the sequence used and the target gene. 

Generally, the transcribed nucleic acid may represent a 
fragment of an Mlo gene, such as including a nucleotide 
sequence shown in Figure 2, or the complement thereof, or may 

10 be a mutant, derivative, variant or allele thereof, in similar 
terms as discussed above in relation to alterations being made 
to a coding sequence and the homology of the altered sequence. 
The homology may be sufficient for the transcribed anti-sense 
RNA to hybridise with nucleic acid within cells of the plant, 

15 though irrespective of whether hybridisation takes place the 
desired effect is down-regulation of gene expression. 

Anti-sense regulation may itself be regulated by employing 
an inducible promoter in an appropriate construct. 

Constructs may be expressed using the natural promoter, by 

20 a constitutively expressed promotor such as the CaMV 35S 

promotor, by a tissue - specif ic or cell-type specific promoter, 
or by a promoter that can be activated by an external signal or 
agent. The CaMV 35S promoter but also the rice actinl and 
maize ubiquitin promoters have been shown to give high levels 

25 of reporter gene expression in rice (Fujimoto et al . , (1993) 

Bio/Technology 11, 1151-1155; Zhang, et al . , (1991) Plant Cell 
3, 1155-1165; Cornejo et al . , (1993) Plant Molecular Biology 
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23 , 567-581) , 

For use in anti-sense regulation, nucleic acid including a 
nucleotide sequence complementary to a coding sequence of a Mlo 
gene (i.e. including homologues) , or a fragment of a said 
5 coding sequence suitable for use in anti -sense regulation of 
expression, is provided. This may be DNA and under control of 
an appropriate regulatory sequence for anti -sense transcription 
in cells of interest. 

Thus, the present invention also provides a method of 
10 conferring pathogen resistance on a plant, the method including 
causing or allowing anti-sense transcription from heterologous 
nucleic acid according to the invention within cells of the 
plant . 

The present invention further provides the use of the 
15 nucleotide sequence of Figure 2 or a fragment, mutant, 

derivative, allele, variant or homologue thereof, such as any 
sequence shown or identified herein, for down -regulation of 
gene expression, particularly down- regulation of expression of 
an Mlo gene or homologue thereof, preferably in order to confer 
20 pathogen resistance on a plant. 

When additional copies of the target gene are inserted in 
sense, that is the same, orientation as the target gene, a 
range of phenotypes is produced which includes individuals 
where over-expression occurs and some where under-expression of 
25 protein from the target gene occurs. When the inserted gene is 
only part of the endogenous gene the number of under-expressing 
individuals in the transgenic population increases. The 



WO ?2/e i 5Sf PCT/GB97/02046 

4 7 

mechanism by which sense regulation occurs, particularly 
down -regulation, is not well-understood. However, this 
technique is well-reported in scientific and patent literature 
and is used routinely for gene control. See, for example, van 
der Krol et al . , (1990) The Plant Cell 2, 291-229; Napoli et 
al., (1990) The Plant Cell 2, 279-289; Zhang et al , 1992 The 
Plant Cell 4, 1575-1588. 

Again, fragments, mutants and so on may be used in similar 
terms as described above for use in anti-sense regulation. 

Thus, the present invention also provides a method of 
conferring pathogen resistance on a plant, the method including 
causing or allowing expression from nucleic acid according to 
the invention within cells of the plant. This may be used to 
suppress Mlo activity. Here the activity of the product is 
preferably suppressed as a result of under-expression within 
the plant cells. 

As noted, Mlo down-regulation may promote activation of a 
defence response, which may in turn confer or augment pathogen 
resistance of the plant, especially resistance to powdery 
mildew and/or rust (e.g. yellow rust) . 

Thus, the present invention also provides a method of 
modulating Mlo function in a plant, the method comprising 
causing or allowing expression from nucleic acid according to 
the invention within cells of the plant to suppress endogenous 
Mlo expression. 

Modified versions of Mlo may be used to down-regulate 
endogenous Mlo function. For example mutants, variants, 
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derivatives etc., may be employed. For instance, expression of 
a mlo mutant sequence at a high level may out -compete activity 
of endogenous Mlo. 

Reduction of Mlo wild type activity may be achieved by 
5 using ribozymes, such as replication ribozymes, e.g. of the 

hammerhead class (Haseloff and Gerlach, 1988, Nature 334: 585- 
591; Feyter et al . Mol . , 1996, Gen. Genet. 250: 329-338). 

Another way to reduce Mlo function in a plant employs 
transposon mutagenesis (reviewed by Osborne et al . , (1995) 

10 Current Opinion in Cell Biology 7, 406-413). Inactivation of 
genes has been demonstrated via a 1 targeted tagging' approach 
using either endogenous mobile elements or heterologous cloned 
transposons which retain their mobility in alien genomes. Mlo 
alleles carrying any insertion of known sequence could be 

15 identified by using PGR primers with binding specificities both 
in the insertion sequence and the Mlo homologue . 'Two-element 
systems' could be used to stabilize the transposon within 
inactivated alleles. In the two-element approach, a T-DNA is 
constructed bearing a non- autonomous transposon containing 

20 selectable or screenable marker gene inserted into an excision 
marker. Plants bearing these T-DNAs are crossed to plants 
bearing a second T-DNA expressing transposase function. Hybrids 
are double-selected for excision and for the marker within the 
transposon yielding F 2 plants with transposed elements. The 

25 two-element approach has a particular advantage with respect to 
Ac/Ds of maize, as the transposed Ds is likely to be unlinked 
to the transposase, facilitating outcrossing and stabilization 
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of the Ds insertion (Jones et al . , (1994) Science 266, 789-793; 
Osborne et al . , (1995) Current Opinion in Cell Biology 7, 406- 
413) . 

5 The mlo-based powdery mildew resistance is caused by the 

inactivation of the Mlo wild type allele, resulting in a 
recessive resistance phenotype . Substances that inhibit the 
activity of the Mlo wild type protein may be used to induce the 
resistance phenotype. 

10 An important hint that complete inactivation of Mlo 

expression is not essential and may even be detrimental is 
provided by the description of mutagen- induced mlo resistance 
alleles that are likely to have retained residual wild type 
allele activity. These alleles exhibit no detectable 

IB spontaneous leaf necrosis which negatively affects 

photosynthesis rates and yield (Hentrich, W (1979) Arch. 
Z-lch tungsvorsch. , Berlin 9, S. 283-291). 

The Mlo protein is predicted to be membrane -anchored by 
seven transmembrane helices (see e.g. Figure 7). This 

20 structure prediction has been reinforced by recent analysis of 
Mlo homologues in rice and Arabidopsis thaliana . Structure 
prediction of the Arahidopsis thaliana homologue also suggests 
the presence of seven transmembrane helices. A comparison of 
the Mlo homologues revealed in addition conserved cysteine 

25 residues in the putative extracellular loops 1 and 3 and high 
probabilities of amphipathic helices in the second 
intracellular loop adjacent to the predicted transmembrane 
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helices 3 and 4. These conserved structural motifs in the 
family of Mlo proteins are reminiscent of G protein coupled 
receptors (GPCR) described extensively in mammalian systems. 
GPCRs are known to be activated by ligands and to amplify 
5 signals intracellularly via heterotrimeric G proteins. Without 
in any way providing a limitation on the nature or scope of any 
aspect of the present invention, it is predicted that Mlo 
activates an inhibitory G alpha subunit of heterotrimeric G 
proteins, thus leading to a downregulat ion of as yet unknown 

10 effector proteins. 

The provision herein of Mlo sequence information enables 
the identification of antagonists of function of the Mlo 
protein (e.g. GPCR function). Antagonists of Mlo may block 
receptor activation by its unknown genuine ligand, mimicking 

15 recessive mutations in the Mlo gene. Such Mlo antagonists may 
be used as crop protection compounds, for example applied 
externally to the plant or crop or, where the compound is 
peptidyl in nature, delivered internally via a biological 
vector (e.g, recombinant infecting viral particle expressing 

20 the antagonistic molecule within target plant cells) or via a 

transgenic route (plants or plant cells genetically modified to 
express the antagonist molecule, perhaps under control of a 
promoter inducible by an externally applied compound (eg GST- I I 
promoter from maize - Jepson et al Plant Molecular Biology 

25 26:1855-1866 (1994)) allowing control over the timing of 
expresion of the mlo inactivation phenotype . 

Leaf segments of Mlo wild type plants may be tested with a 



PCT/GB97/02046 



51 

test substance, e.g. from a random or combinatorial compound 
library, for resistance upon challenge with pathogen such as 
powdery mildew. The detached leaf segment assay is used as a 
standard test system to score for susceptibility/resistance 
5 upon inoculation with powdery mildew spores. Leaf segments of 
7 -day-old seedlings of the genotype Mlo Rorl may be placed on 
agar, for example individual wells of 96-well microtiter plates 
containing 50/zl agar. Different compounds may be applied to 
the agar surface in each well at a concentration of about Ippm 

10 dissolved in DMSO. Around seven days after inoculation of the 
detached leaf segments with pathogen, such as spores of a 
virulent powdery mildew isolate, compounds which induce 
resistance may be recognised by the absence of fungal mycelium 
on leaf segments in the microtiter plates. 

15 A further selection may be used to discriminate between 

compounds that act in the mlo pathway and those that confer 
resistance by other mechanisms, or those which exhibit a direct 
fungitoxic activity. For this purpose mutants in genes {Ror 
genes) which may be required for mlo resistance ( Freialdenhoven 

20 et al., (1996), The Plant Cell 8, 5-14) may be used. Mutants 
of these genes confer susceptibility to powdery mildew attack 
despite the presence of mlo resistance alleles. Plants of the 
genotype Mlo rorl (wild type Mlo protein and defective Rorl 
gene) may be used, for example, to test compounds which induce 

25 resistance on Mlo Rorl genotypes but exhibit susceptibility on 
the Mlo rorl genotype, enabling selection of candidate Mlo 
antagonists. Testing candidate compounds identified using a 
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leaf segment test may be used to drastically reduce the number 
of candidate compounds for further in vitro tests. 

A further selection step of candidate antagonists may 
involve heterologous expression of the Mlo protein or a 
fragment thereof (e.g. in a baculovirus insect cell system) and 
subsequent binding assays with labelled molecules. Specific 
binding of compounds to cell lines expressing wild type Mlo 
protein is a good indicator of their antagonistic mode of 
action. Analsis of the deduced Mlo protein sequence has 
provided strong evidence that the protein is anchored in the . 
membrane via seven transmembrane helices and may represent a 
novel member of the so-called serpentine receptor family. The 
conclusion is supported by the sequence data derived from 
homologous genes identified in barley, rice and Arabidopsis . 
Seven transmembrane proteins have been shown to be expressed at 
high level in the Baculovirus / insect cell system (up to 10 7 
molecules per cell - Tate and Grisshamer, 1996, TIBTECH 14: 
426-430) . Since the family of Mlo proteins appears to be 
restricted to the plant kingdom, this provides a low-background 
environment for compound tests. Candidate compounds which are 
labelled, radioactively or non-radioactively, may be tested for 
specific binding to Sf9 insect cells expressing the Mlo protein 
after infecion with a recombinant baculovirus construct. 
Specificity of the binding may be tested further by Sf9 
expression of mutant mlo proteins which carry characterised 
mutations (e.g. as in Table 1) leading in vivo to resistance. 



wn or /ft j ^e/; 



PCT/GB97/02046 



53 

Thus, in various further aspects the present invention 
relates to assays for substances able to interfere with Mlo 
function, i.e. confer a mlo mutant phenotype, such substances 
themselves and uses thereof. 
5 The use of Mlo in identifying and/or obtaining a substance 

which inhibits Mlo function is further provided by the present 
invention, as is the use of Mlo in identifying and/or obtaining 
a substance which induces pathogen resistance in a plant. 

10 Agents useful in accordance with the present invention may 

be identified by screening techniques which involve determining 
whether an agent under test inhibits or disrupts Mlo function 
to induce an mlo phenotype. Candidate inhibitors are 
substances which bind Mlo. 

15 It should of course be noted that references to "Mlo" in 

relation to assays and screens should be taken to refer to 
homologues, such as in other species, including rice and wheat, 
not just in barley, also appropriate fragments, variants, 
alleles and derivatives thereof. Assessment of whether a test 

20 substance is able to bind the Mlo protein does not necessarily 
require the use of full-length Mlo protein. A suitable 
fragment may be used (or a suitable analogue or variant 
thereof) . 

Suitable fragments of Mlo include those which include 
25 residues known to be crucial for Mlo function as identified by 
mlo mutant alleles (Table 1). Smaller fragments, and analogues 
and variants of this fragment may similarly be employed, e.g. 
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as identified using techniques such as deletion analysis or 
alanine scanning. 

Furthermore, one class of agents that can be used to 
disrupt Mlo activity are peptides fragments of it. Such 
peptides tend to be short, and may be about 4 0 amino acids in 
length or less, preferably about 35 amino acids in length or 
less, more preferably about 30 amino acids in length, or less, 
more preferably about 25 amino acids or less, more preferably 
about 20 amino acids or less, more preferably about 15 amino 
acids or less, more preferably about 10 amino acids or less, or 
9, 8, 7, 6, 5 or less in length. The present invention also 
encompasses peptides which are sequence variants or derivatives 
of a wild type Mlo sequence, but which retain ability to 
interfere with Mlo function, e.g. to induce an mlo mutant 
phenotype. Where one or more additional amino acids are 
included, such amino acids may be from Mlo or may be 
heterologous or foreign to Mlo. A peptide may also be included 
within a larger fusion protein, particularly where the peptide 
is fused to a non-Mlo(i.e. heterologous or foreign) sequence, 
such as a polypeptide or protein domain. 

Peptides may be generated wholly or partly by chemical 
synthesis. The compounds of the present invention can be 
readily prepared according to well-established, standard liquid 
or, preferably, solid-phase peptide synthesis methods, general 
descriptions of which are broadly available (see, for example, 
in J.M. Stewart and J.D. Young, Solid Phase Peptide Synthesis, 
2nd edition, Pierce Chemical Company, Rockford, Illinois 
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.1984), in M. Bodanzsky and A. Bodanzsky, The Practice of 
Peptide Synthesis, Springer Verlag, New York (1984); and 
Applied Biosystems 430A Users Manual, ABI Inc., Foster City, 
California) , or they may be prepared in solution, by the liquid 
5 phase method or by any combination of solid-phase, liquid phase 
and solution chemistry, e.g. by first completing the respective 
peptide portion and then, if desired and appropriate, after 
removal of any protecting groups being present, by introduction 
of the residue X by reaction of the respective carbonic or 

10 sulfonic acid or a reactive derivative thereof. 

Another convenient way of producing a peptidyl molecule 
according to the present invention (peptide or polypeptide) is 
;o express nucleic acid encoding it, by use of nucleic acid in 
an expression system, as discussed elsewhere herein. This 

15 allows for peptide agents to be delivered to plants 

rransgenically, by means of encoding nucleic acid. If coupled 
:o an inducible promoter for expression under control of the 
user, this allows for flexibility in induction of an mlo 
phenotype and pathogen resistance. This may allow for any 

20 side-effects arising from interference with Mlo function to be 
moderated. 



In one general aspect the present invention provides an 
assay method for a substance able to interact with the relevant 
25 region of Mlo, the method including: 

(a) bringing into contact a Mlo polypeptide or peptide 
fragment thereoof, or a variant, derivative or analogue 
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thereof, and a test compound; and 

(b) determining interaction or binding between said 
polypeptide or peptide and the test compound. 

A test compound found to interact with the relevant 
5 portion of Mlo may be tested for ability to modulate, e.g. 

disrupt or interfere with, Mlo function, as discussed already 
above . 

Another general aspect of the present invention provides 
10 an assay method for a substance able to induce an mlo mutant 
phenotype in a plant, the method including: 

(a) bringing into contact a plant or part thereof (e.g. 
leaf or leaf segment) and a test compound; and 

(b) determining Mlo function and/or pathogen resistance 
15 and/or stimulation of a defence response in the plant. 

Susceptibility or resistance to a pathogen may be 
determined by assessing pathogen growth, e.g. for powdery 
mildew the presence or absence, or extent, of mycelial growth. 

Binding of a test compound to a polypeptide or peptide may 
20 be assessed in addition to ability of the test compound to 

stimulate a defence response in a plant. Such tests may be run 
in parallel or one test may be performed on a substance which 
tests positive in another test. 

25 Of course, the person skilled in the art will design any 

appropriate control experiments with which to compare results 
obtained in test assays. 
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Performance of an assay method according to the present 
invention may be followed by isolation and/or manufacture 
and/or use of a compound, substance or molecule which tests 
positive for ability to modulate Mlo function and/or induce 
pathogen resistance, such as resistance to powdery mildew. 

The precise format of an assay of the invention may be 
varied by those of skill in the art using routine skill and 
knowledge. For example, interaction between substances may be 
studied in vitro by labelling one with a detectable label and 
bringing it into contact with the other which has been 
immobilised on a solid support. Suitable detectable labels, 
especially for peptidyl substances include 35 S-methionine which 
may- be incorporated into recombinant ly produced peptides and 
polypeptides. Recombinantly produced peptides and polypeptides 
may also be expressed as a fusion protein containing an epitope 
which can be labelled with an antibody. 

An assay according to the present invention may also take 
the form of an in vivo assay. The in vivo assay may be 
performed in a cell line such as a yeast strain or mammalian 
cell line in which the relevant polypeptides or peptides are 
expressed from one or more vectors introduced into the cell. 

For example, a polypeptide or peptide containing a 
fragment of Mlo or a peptidyl analogue or variant thereof as 
disclosed, may be fused to a DNA binding domain such as that of 
the yeast transcription factor GAL 4. The GAL 4 transcription 
factor includes two functional domains. These domains are the 
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DNA binding domain (GAL4DBD) and the GAL4 transcriptional 
activation domain (GAL4TAD) . By fusing such a polypeptide or 
peptide to one of those domains and another polypeptide or 
peptide to the respective counterpart, a functional GAL 4 
5 transcription factor is restored only when two polypeptides or 
peptides of interest interact. Thus, interaction of the 
polypeptides or peptides may be measured by the use of a 
reporter gene probably linked to a GAL 4 DNA binding site which 
is capable of activating transcription of said reporter gene. 

10 This assay format is described by Fields and Song, 1989, Nature 
340 ; 245-246. This type of assay format can be used in both 
mammalian cells and in yeast. Other combinations of DNA 
binding domain and transcriptional activation domain are 
available in the art and may be preferred, such as the LexA DNA 

15 binding domain and the VP60 transcriptional activation domain. 
When looking for peptides or other substances which 
interact with Mlo, the Mlo polypeptide or peptide may be 
employed as a fusion with (e.g.) the LexA DNA binding domain, 
with test polypeptide or peptide (e.g. a random or 

20 combinatorial peptide library) as a fusion with (e.g.) VP60. 

An increase in reporter gene expression (e.g. in the case of /3- 
galactosidase a strengthening of the blue colour) results from 
the presence of a peptide which interacts with Mlo, which 
interaction is required for transcriptional activation of the 

25 0-galactosidase gene . 



The amount of test substance or compound which may be 
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added to an assay of the invention will normally be determined 
by trial and error depending upon the type of compound used. 
Typically, from about 0.001 nM to ImM or more concentrations of 
putative inhibitor compound may be used, for example from 0.01 
5 nM to IOOjxM, e.g. 0.1 to 50 /*M, such as about 10 fiM. Greater 
concentrations may be used when a peptide is the test 
substance. Even a molecule which has a weak effect may be a 
useful lead compound for further investigation and development. 
Compounds which may be used may be natural or synthetic 

10 chemical compounds used in drug screening programmes. Extracts 
of plants which contain several characterised or 
uncharacterised components may also be used. Antibodies 
directed to Mlo or a fragment thereof form a further class of 
putative inhibitor compounds. Candidate inhibitor antibodies 

15 may be characterised and their binding regions determined to 

provide single chain antibodies and fragments thereof which are 
responsible for disrupting the interaction. Other candidate 
inhibitor compounds may be based on modelling the 3 -dimensional 
structure of a polypeptide or peptide fragment and using 

20 rational drug design to provide potential inhibitor compounds 
with particular molecular shape, size and charge 
characteristics. It is worth noting, however, that 
combinatorial library technology provides an efficient way of 
testing a potentially vast number of different substances for 

25 ability to interact with and/or modulate the activity of a 
polypeptide. Such libraries and their use are known in the 
art, for all manner of natural products, small molecules and 
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peptides, among others. The use of peptide libraries may be 
preferred in certain circumstances. 

Following identification of a substance or agent which 
modulates or affects Mlo function, the substance or agent may 
be investigated further. Furthermore, it may be manufactured 
and/or used in preparation, i.e. manufacture or formulation, of 
a composition for inducing pathogen resistance in a plant. 
These may be applied to plants, e.g. for inducing pathogen 
resistance, such as resistance to powedery mildew. A further 
aspect of the present invention provides a method of inducing 
pathogen resistance in a plant, the method including applying 
such a substance to the plant. A peptidyl molecule may be 
applied to a plant transgenically , by expression from encoding 
nucleic acid, as noted. 

A polypeptide, peptide or other substance able to modulate 
or interfere with Mlo function, inducing pathogen resistance in 
a plant as disclosed herein, or a nucleic acid molecule 
encoding a peptidyl such molecule, may be provided in a kit, 
e.g. sealed in a suitable container which protects its contents 
from the external environment. Such a kit may include 
instructions for use. 

Further aspects and embodiments of the present invention 
will be apparent to those skilled in the art. The present 
invention will now be exemplified by way of illustration with 
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reference to the following figures: 



Figure 1 Positional Cloning of Mlo. The Mlo locus has 
been mapped with increasing precision on the long arm of barley 
5 chromosome 4 using morphological, RFLP and AFLP markers. The 
upper part of the figure presents the genetic linkage maps of 
these markers relative to Mlo. All genetic distances are 
indicated in centiMorgan (cM) based on multi-point linkage 
analysis except for genetic distances between AFLP markers 

10^ which are calculated by two-point-estimates. The morphological 
marker map (Jorgensen, 1977) positions Mlo at a distance of 
more than 2 0 cM to hairy leaf sheath (tfs) and glossy 
sheath/spike (gsl) . The RFLP marker map is based on the 
analysis of 2 57 F 2 individuals derived from the cross Carlsberg 

15 II Mlo Grannenlose Zweizeilige mlo-11. The previously 

published RFLP map (Hinze et al . , 1991) of the same cross was 
based on only 44 F 2 individuals. The gene was delimited to a 
2.7 cM interval bordered by markers bAOll and bAL88. AFLP 
markers were identified and mapped as described in Experimental 

20 procedures. Their genetic distance to Mlo is based on the 

cross Ingrid Mlo x BC 7 Ingrid mlo-3. The crucial result of the 
AFLP analysis- has been the identification of two markers, Bpm2 
and Bpm9, defining an 0.64 cM interval containing the Mlo locus 
and one marker (Bpml6) cosegregating with Mlo on the basis of 

25 more than 4,000 meiotic events. Marker Bxm2 which is located 
0.1 cM telomeric to Mlo was derived from BAC F15 template DNA 
(see below) . One YAC clone, YAC YHV303-A6, containing the 
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cosegregating marker BpmlG and two flanking loci (Bpm2 and 
Bpm9) , is shown in the middle section of the figure. The 
position of marker Bpm9 was only roughly estimated within the 
YAC clone as indicated by the arrow. The insert of BAC F15 
represents a 60 kb subfragment of this YAC as indicated in the 
lower part of the Figure. After the identification of AFLP 
marker Bpm2 in BAC F15, marker Bxm2 was discovered and 
positioned 0.1 cM in telomeric orientation of Mlo. The 
approximate physical position of AFLP markers Bpm2, Bpml6, and 
Bxm2 (spanning an interval of approximately 3 0 kb) as well as 
the location of some rare occurring restriction sites are 
indicated. Dashed lines below the schematic representation of 
BAC F15 DNA show the position of the largest established DNA 
sequence contigs. The structure of the Mlo gene is given 
schematically in the bottom line of the Figure. Exons are 
highlighted by black boxes. Positions of mutational events are 
indicated for the eleven tested mlo alleles. Mutant alleles 
carrying deletions in their nucleotide sequence are marked with 
a a; the remaining mutant alleles represent single nucleotide 
substitutions resulting in amino acid exchanges in each case . 

Figure 2 shows an Mlo coding sequence and encoded amino 
acid sequence according to the present invention. The amino 
acid sequence predicted from DNA sequences of RT-PCR products 
from Ingrid Mlo are shown. Nucleotide numbers are given 
according to translational start site. 

Figure 3 Northern Blot Analysis of Mlo Transcript 
Accumulation. Total RNA (20 /xg) and poly (A) + RNA (5 /xg) of 
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seven-day-old uninfected barley primary leaves of one wild type 
(cultivar Ingrid Mlo) and two mutant (BC Ingrid mlo-1, BC 
Ingrid mlo-3) cultivars were isolated, separated on a 1.2% 
formaldehyde gel and transferred to a nitrocellulose membrane 
5 (Hybond) . The filter was probed under stringent conditions 
(Sambrook et al . , 1989) with the radioactivity labelled full 
size RT-PCR product derived from Ingrid AUo (Figure 7) . A 
clear signal is detected only in the lanes containing poly (A) + 
RNA. The signal corresponds to a size of approximately 2 kb. 

10 Figure 4 Southern Blot Analysis of Intragenic Recombinants 

derived from mlo heteroallelic crosses. The alleles of two 
RFLP markers flanking Mlo on opposite sides of either 
susceptible F 2 individuals or homozygous susceptible and 
homozygous resistant progeny were determined by Southern blot 

15 analysis. Plant DNA (10 fig) of the individuals were digested 
with Pst I (A) or Hae III (B) and hybridized with the 
radioactively labelled RFLP markers WG114 (upper panel; maps 
3.1 cM in centromeric orientation to Mlo; see Figure 1) and 
ABG3 6 6 (lower panel; maps 0.7 cM in telomeric orientation to 

20 Mlo; see Figure 1) according to standard procedures (Sambrook 
et al . , 1989) . 

A DNA of the parental lines mlo-8 and mlo-1 and two 
homozygous susceptible (S, Mlo Mlo) and two resistant (R, mlo 
mlo) progenies derived from two susceptible F 2 plants 

25 (designated 1 and 2) were tested. The DNAs in lanes S and R 

represent selection F 3 individuals from F 3 families obtained by 
selfing the susceptible F 2 individuals 1 and 2. Note that 
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susceptible F 2 individuals are expected to be heterozygous at 
Mlo in this section scheme. Infection phenotypes were scored 
seven days after inoculation with the mlo avirulent isolate Kl . 
DNA from a third susceptible individual of this heteroallelic 
cross (see Table 7) is not included in this Figure. 

B DNA of the parental lines mlo-5 and inlo-l and seven 
homozygous susceptible (S, Mlo Mlo) and seven resistant (R, mlo 
mlo) progeny derived from seven susceptible F 2 plants 
(designated 1 to 7) were tested. The DNAs in lanes S and R 
represent selected F 3 individuals from F 3 families obtained by 
selfing the susceptible F 2 individuals 1 to 7 . DNA was 
analyzed from two further susceptible individuals of this 
heteroallelic cross only in the F 2 generation (8* and 9*) . 

Figure 5 shows an alignment of genomic sequences covering 
the barley Mlo gene and a rice homologue isolated via 
crosshybridization with a barley gene specific probe. The top 
line shows the barley Mlo genomic DNA sequence (exon sequences 
underlined) . The bottom line shows the rice genomic sequence 
containing the rice Mlo homologue. 

Figure 6 shows an alignment of genomic sequences carrying 
the barley Mlo gene and a barley homologue isolated via 
crosshybridization with a barley gene specific probe. The top 
line shows the barley Mlo genomic DNA sequence (exon sequences 
underlined) . The bottom line shows the genomic sequence 
containing the barley Mlo homologue. 

Figure 7 Nucleotide and Deduced Amino Acid Sequence of the 
Barley Mlo cDNA. The nucleotide and the deduced amino acid 
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sequence are based on the combined data of RT-PCR and RACE 
obtained from experiments using RNA of cultivar Ingrid Mlo. 
The stop codon is marked by an asterisk, the putative 
polyadenylation signal is underlined and the detected termini 
of RACE products are indicated by arrows above the sequence. 
Positions of introns as indentified by comparison with 
corresponding genomic clones are labelled by triangles below 
the nucleic acid sequence. Six predicted transmembrane 
spanning helices according to the MEMSAT algorithm (Jones et 
al . , 1994) are boxed in grey colour. A putative nuclear 
localization signal (K-K-K-V-R) and casein kinase II site (S-I- 
F-D) in the carboxy- terminal half of the protein are shown in 
bold type. 

Figure 8 shows genomic sequence of rice (Oryza sativa) 
homologue including coding and flanking sequences. 

Figure 9 shows genomic sequence of barley (Hordeum 
vulgare) homologue including coding and flanking sequences. 

Figure 10 shows cDNA sequence of rice homologue. 

Figure 11 shows cDNA sequence of barley homologue. 

Figure 12 shows cDNA sequence of Arabidopsis thaliana 
homologue . 

Figure 13 shows amino acid sequence of rice homologue. 
Figure 14 shows amino acid sequence of barley homologue. 
Figure 15 shows amino acid sequence of Arabidopsis 
homologue . 

Figure 16 shows a pretty box of amino acid sequences of 
Mlo, barley, rice and Arabidopsis homologues. 
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All documents mentioned in this document are incorporated 
by reference. 



EXAMPLE 1 - CLONING OF MLO OF BARLEY 

5 

Targeted search for AFLP markers tightly linked to Mlo 

Efforts to increase the DNA marker density around Mlo were 
coordinated with attempts to construct a local high resolution 
genetic map. An alternative possibility would have been to 

10 extend the population size of the characterized cross Carlsberg 
II Mlo x Grannenlose Zweizeilige mlo-11 {Hinze et al . , 1991) 
but it was felt to be advantageous to establish a high 
resolution map starting out from one of the available 
BC mlo lines and its recurrent parent line. Importantly, the 

15 donor parent of the BC line represents a different genetic 

background in comparison to the recurrent parent line. In this 
way, searching for linked AFLP markers could be started in 
parallel with generating a large mapping population from a 
cross between the same genetic lines. In addition, the BC line 

20 based cross allowed testing of colinearity of DNA markers in 
the vicinity of Mlo as determined from the cross 
Carlsberg II Mlo x Grannenlose Zweizeilige mlo-11 (Hinze et 
al . , 1991) . For the new cross a mlo-3 backcross (BC) line was 
used that had been backcrossed seven times into the genetic 

25 background Ingrid (BC 7 Ingrid mlo-3 ; Hinze et al . , 1991). The 
line was previously characterized to carry a relatively small 
introgressed DNA segment on barley chromosome 4. In addition, 
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the donor parent line Malteria Heda mlo-3 exhibits in 
comparison to DNA from the recurrent parent Ingrid 
polymorphisms with most of the identified RFLP loci linked to 
Mlo. Thus, by searching polymorphisms only between two DNA 
5 templates, from lines Ingrid Mlo and BC^ Ingrid mlo-3, we hoped 
to increase the density of DNA markers with AFLPs around Mlo in 
a targeted manner. 

The same two lines were crossed to establish a segregating 
population for high resolution mapping of DNA markers, formally 

10 representing an eigth backcross. F 2 individuals were scored for 
mlo resistance after powdery mildew inoculation with isolate Kl 
(virulent on Ingrid Mlo and avirulent on BC 7 Ingrid mlo-3 ) . 
Initially, only a small fraction of the F 2 (77 individuals) was 
analyzed for recombination events with flanking RFLP markers. 

15 Analysis of four identified recombinants (designated 8-32-2, 7- 
38-4, 1-34-1, and 1-49-4) indicated colinearity of marker order 
in this cross compared to the previously analyzed cross 
Carlsberg II Mlo x Grannenlose Zweizeilige mlo-11 (Hinze et 
al., 1991) . Several of the 77 F 2 seedlings which exhibited a 

20 susceptible phenotype and heterozygosity for the tested 

flanking DNA marker loci (bAOll, bAL88/2, and bAP91; Hinze et 
al . , 1991) were grown to maturity to provide further selfed 
seed material segregating for Mlo/ mlo-3 in the F 3 generation. 
In total, leaf material was harvested for high resolution 

25 marker mapping from 2,026 individuals derived from either the 
selfed F 2 or F 3 generation. 

AFLP marker candidates were identified by testing all 
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possible Pst I/Mse I primer combinations (1,024) extending into 
genomic sequences up to nucleotide positions + 2 and +3, 
respectively. Similarly, almost 1,900 Eco RI/Mse I primer 
combinations (+3/ + 3) have been analyzed. Four DNA templates 
were included in this analysis: Ingrid Mlo, BC 7 Ingrid mlo-3 , a 
DNA pool of two phenotypically mlo resistant F 2 individuals, 
and a DNA pool of nine phenotypically susceptible F 2 
individuals. The resistant and susceptible F 2 individuals which 
were included as DNA pools in the AFLP search had been selected 
from the above mentioned RFLP analysis of 77 F 2 segregants. The 
pooled F 2 DNA enabled us to control whether candidate 
polymorphisms detected between template DNA from the parents 
were heritable traits in the F 2 . All identified AFLP candidate 
markers have been re-examined with eight DNA templates: Ingrid 
Mlo, BC 7 Ingrid mlo-3 , DNA pools from individuals of three F 3 
families which were phenotypically homozygous susceptible 
{MloMlo) according to Kl inoculation experiments; DNA of three 
resistant F 2 individuals. A total of 18 Pst I/Mse I and 20 
Eco RI/Mse I primers were confirmed based on the selection 
procedure . 

The number of identified AFLP markers made it useful to 
assign them first roughly to marker intervals based on the RFLP 
map around Mlo. It was hoped that this approach should enable 
both evaluation of the distribution of AFLPs among previously 
identified RFLP intervals close to Mlo and selection of a pair 
of flanking AFLP markers with which recombinants could be 
identified among the 2,026 segregants. For AFLP assignment we 
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used those four recombinants that had been identified with RFLP 
markers out of the above mentioned small sample of 77 F 2 
segregants from Ingrid Mlo x BC 7 Ingrid mlo-3 (two recombinants 
in interval bAP91-bAL88, one in Mlo-bAOll, and one in bAOll- 
ABG36 6) . A total of 18 AFLPs were found to be located within a 
genetic distance of approximately 3.5 cM including Mlo. 

Construction of a high resolution AFLP map around Mlo 

A two-step procedure was used to construct the high 
resolution AFLP map. First, all 2,026 segregants were screened 
for recombination events between two AFLP markers on opposite 
sides of Mlo and subsequently only the few identified 
recombinants were used to map all the identified AFLPs in the 
3.5 cM target interval. AFLP markers Bpml and Bpm9 were 
chosen, detecting each allelic DNA fragments in Ingrid Mlo and 
BC 7 Ingrid mlo-3 and located on opposite sites of Mlo to screen 
DNA templates of the segregants for recombination events. 
Alternatively, the search for recombinants could have been 
carried out with the flanking RFLP markers bAOll and bAL8 8 . 
However, although the conversion into cleaved amplified 
polymorphic sites (CAPS) was successful for both markers, 
difficulties to display the alleles of both loci simultaneously 
from crudely purified genomic DNA were encountered. A total of 
2,026 individuals (F 2 . or F 3 segregants) were screened 
simultaneously with AFLP markers Bpml and Bpm9 and 98 
recombinants were identified. AFLP analysis was subsequently 
carried out with each of the 98 DNA templates of the 
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recombinants to identify the alleles of each of the identified 
of AFLP loci . The recombinants have been self ed and 
inoculation experiments with powdery mildew isolate Kl were 
performed using at least 25 individuals of each recombinant 
family to deduce the alleles of the previous generation at the 
Mlo locus. The obtained data enabled the construction of a high 
resolution map around Mlo based on more than 4,000 meiotic 
events and a resolution of at least 0.025 cM derived via two- 
point estimates. The essential result has been the 
identification of a DNA marker cosegregating with Mlo (Bpml6) 
and two flanking markers (Bpm2 and Bpm9) at a distance of 0.25 
and 0.4 cM respectively (Figure 1). 

Construction of a large insert size barley YAC library, 
isolation of BpmlG containing YACs , and physical delimitation 
of Mlo 

The genetic evidence indicates that mlo resistance is due 
to loss of function in the Mlo wild type allele. Therefore, it 
was decided to establish a large insert size YAC library from 
cultivar Ingrid Mlo into vector pYAC4 (Burke et al . , 1987; 
Hieter, 1990) . Megabase DNA suitable for YAC cloning 
experiments was prepared in mg amounts from mesophyll 
protoplasts of five-day-old seedlings according to a modified 
protocol described by Siedler and Graner (1991) . The DNA was 
partially digested with Eco RI in the presence of Eco RI 
methyltransf erase to obtain DNA fragments after preparative 
pulsed-field gel electrophoresis (PFGE) in the size range of 
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500-600 kb. After ligation with Eco RI digested pYAC4 , the DNA 
was transformed into yeast strain AB1380 and colonies carrying 
recombinant pYAC4 DNA were selected on solidified synthetic 
complete medium lacking tryptophan and uracil (Sherman et al . , 
5 1986) . Forty randomly selected yeast colonies were tested for 

the presence of barley DNA using labelled barley genomic DNA in 
Southern experiments. The size of the YAC inserts was found 
after PFGE separations to vary between 500 and 800 kb. On 
average a genetic distance of 0.2 cM was expected to be 

10 represented on the individual recombinant YAC clone. A total of 
-40,000 clones representing four barley genome equivalents have 
been generated. 

Four YAC clones (designated 303A6, 322G2, 400H11, and 
417D1) have been isolated with marker Bpml6 cosegregating with 

15 Mlo. Their insert size was determined by PFGE to be 650 , 710, 
650, and 820 kb respectively. AFLP analysis had shown that 
three of these clones (303A6, 322G2, and 417D1) contain also 
both flanking marker loci whereas clone 400H11 contains only 
loci Bpml6 and Bpm2 . These findings strongly suggested that the 

2 0 Mlo gene had been physically delimited on recombinant YAC 
clones 303A6, 322G2, and 417D1. 

YAC 3 03A6 was chosen for subcloning experiments into BAC 
vector pECSBAC4 containing a unique Eco RI site (Shizuya et 
al. f 1992; the vector pECSBAC4 is described by Frijters and 

25 Michelmore, 1996; submitted) . Total yeast DNA of this clone was 
partially digested with Eco RI to obtain DNA fragments with an 
average size of 50 kb and ligated into Eco RI digested and 
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dephosphorylated BAC vector. Bacterial colonies containing 
YAC 3 03 A6 -derived DNA in pECSBAC4 were identified by replica 
colony hybridization experiments. One set of colony containing 
membranes was hybridized with labelled yeast AB1380 DNA and the 
replica set was hybridized with labelled PFGE-purif ied YAC303A6 
DNA. Recombinant BAC clones containing the AFLP locus Bpml6 
were subsequently identified using the cloned 108 bp 
Pst I/Mse I genomic Bpml6 fragment as a probe in colony 
hybridization experiments . 

One BAC clone, BAC F15, containing an insert of - 60 kb 
was chosen for further detailed studies. It was found that the 
recombinant BAC clone contained in addition the AFLP marker 
locus Bpm2, but not Bpm9 . At this point the BAC F15 insert DNA 
indicated successful physical delimitation in telomeric 
orientation but it was an open question whether the insert 
would contain bordering sequences in centromeric direction. 
Instead of constructing a BAC contig between Bpm 16 and Bpm9, 
the option to develop new polymorphic markers from BAC F15 was 
chosen. An allelic Xba 1/Mse I polymorphism (designated Bxm2) 
was identified between the parental lines Ingrid Mlo and 
BC 7 Ingrid mlo-3. 

An analysis of the 25 recombinant individuals carrying 
recombination events within the Mlo containing interval Bpm2- 
Bpm9 enabled mapping of Bxm2 in centrometric orientation at a 
distance of 0.1 cM from Mlo. Only four out of the 16 available 
recombinants in the interval Bpm9 -Mlo and none of the 9 
recombinants in the interval Mlo-Bpm2 were found to exhibit a 
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recombination event between Bxm2 and Mlo. It was concluded 
that Mlo had been physically delimited on BAC F15 between 
marker loci Bpm2 and Bxm2 (Figure 1) . 

Identification of the Mlo gene and mlo mutants 

A random sequencing project was initiated to determine 
sequence contigs of the -60 kb insert of BAC F15 before marker 
Bxxn2 was identified and shown to delimit the gene in telomeric 
orientation. In parallel, a physical map was generated 
(Figure 1) . The physical map indicated that the flanking 
markers Bpm2 and Bxm2 are physically separated by -3 0 kb. The 
sequence contigs were searched for regions of high coding 
probability using the UNIX versions of the STADEN program 
package. Only one sequence contig of almost 6 kb, including the 
cosegregating marker Bpml6, revealed an extensive region of 
high coding probability. 

RT-PCR reactions were performed with total leaf RNA 
derived from cultivar Ingrid Mlo using a series of primers 
deduced from regions which indicated high coding probabilities 
and obtained in each case a distinct amplification product. 
Sequencing of the largest RT-PCR products revealed a single 
extensive open reading frame of 1,602 bp (Figure 2). The 
deduced putative protein of 533 amino acids has a molecular 
weight of 6 0.4 kDal . The -1.7 kb RT-PCR product was used as a 
hybridization probe and detected a single RNA transcript of 
-1.9 kb length. (Figure 3). A comparison of the genomic 
sequence and the largest RT-PCR fragment reveals 12 exons and 
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11 introns, each flanked by the characteristic splice site 
sequences (Figure 1) . 

Because marker Bpml6 is located at the 3' end of the above 
described gene (exon 11) and cosegregates with the Mlo locus, 
we started a direct PCR sequencing of the various available 
mutagen- induced mlo resistance alleles. We identified in 14 
out of 15 tested mutant alleles nucleotide alterations which 
result either in single amino acid alterations, deletions or 
frame shifts of the wild type sequence {Table 1) . We suspect 
that mutant allele mlo-2 is located within the promoter- or 5' 
untranslated sequences. The region is notoriously difficult to 
be sequenced via direct PCR sequencing from genomic DNA 
templates but experiments using a series of nested primers are 
likely to solve this problem. In summary, the comparative 
sequencing of genomic DNA from various mutant mlo lines and 
their respective Mlo wild type ciltivars provided strong 
evidence that Mlo has been identified. 

In trageni c recombinants 

It had been the intention to provide a chain of evidence 
for the molecular isolation of Mlo which did not rely upon 
complementation experiments via transgenic barley plants. We 
had chosen to develop an unusual genetic tool to confirm that 
the identified gene represented Mlo. It was reasoned that if 
the mutations observed in the above described gene caused 
resistance to the powdery mildew fungus, recombination events 
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between mutant allele sites should restore wild type sequences. 

It was predicted that those intragenic recombinants would 

exhibit susceptibility upon powdery mildew attack. 

A crossing scheme was devised involving mlo resistance 

5 alleles mlo~l, mlo-5 r and mlo-8. The mutant alleles originate 

from the genetic backgrounds Haisa [mlo-D and Carlsberg II 

(mlo-5 and mlo-8) . Intermutant crosses were performed as shown 

in Table 2 generating in each case at least 10 F x plants. F 2 

- populations were obtained by self-fertilization. F 2 seedlings 

10 - were screened for rare susceptible individuals after 

inoculation with powdery mildew isolate Kl which is virulent on 

each of the parental Mlo wild type cultivars. Susceptible F 2 

-4 

individuals were identified with a frequency of -6 x 10 . In 
contrast, .if comparable numbers of progenies from selfings of 

15 each of the mlo mutants were tested for resistance to Kl , no 
susceptible seedling was identified. This finding strongly 
indicated that the majority of the susceptible individuals 
derived from the intermutant crosses were not due to 
spontaneous reversion events of the mutant mlo alleles. 

20 Inheritance of the susceptible F 2 individuals was tested 

after selfing in F 3 families. Each of the F2 individuals 
segregated susceptible and resistant F 3 individuals indicating 
hetrozygosity for alleles conferring resistance/susceptibility 
in the F 2 . Homozygous susceptible F 3 progeny were isolated for 

25 the majority of susceptible F 2 individuals by selfing of F 3 
individuals and subsequent identification of F 4 families in 
which only susceptible individuals were detected. 
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A molecular analysis of the susceptible individuals has 
been performed using RFLP markers known to be tightly linked 
(< 3 cM) on each side of the Mlo locus (Figure 4) . RFLP marker 
WG114 maps in centromeric orientation relative to Mlo, marker 
ABG366 maps in the direction of the telomere. Detected RFLP 
alleles are shown for the intermutant crosses mlo-8 x mlo-1 (A) 
and mlo-1 x mlo-5 (B) . DNA was analyzed either from susceptible 
F 2 individuals (indicated by *) or from homozygous susceptible 
(S) and homozygous resistant (R) F 3 progeny obtained from 
selfed susceptible F 2 individuals. 

The homozygous susceptible F 3 progeny from the susceptible 
F 2 plant #1 of cross 

mlo-8 x mlo-1 (Figure 4) reveals the WG114 allele derived from 
the mlo-1 parent in centromeric orientation next to Mlo and the 
ABG366 allele from the mlo-8 parent in telomeric orientation to 
Mlo. The homozygous resistant F 3 progeny from F 2 plant #r of 
this cross reveals in contrast only the flanking marker alleles 
derived from parent mlo-1. The finding strongly suggested that 
susceptibility in F 2 plant #1 is caused by a cross-over type of 
recombination in the preceding meiosis of one chromosome which 
results in a restoration of the Mlo wild type allele whereas 
the second F 2 chromosome of individual 1 contains a 
functionally unaltered mlo-1 allele. The allelotypes of the 
RFLP loci of the homozygous susceptible F 3 progeny from 
susceptible F 2 plant #2 are identical to the one described 
above. However, flanking marker alleles from the homozygous 
resistant F 3 progeny of this individual are in both cases 
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derived from the mlo-8 parent. It is concluded that again a 
cross-over type of recombination restored one Mlo wild type 
allele in the susceptible F 2 individual #2. 

Nine susceptible F 2 individuals were recovered from the 
cross mlo-1 x mlo-5 (Figure 4) . For susceptible F 2 individuals 
#1 to #7 both homozygous susceptible and homozygous resistent 
F 3 progeny were analyzed at the DNA level. Note that only DNA 
from the heterozygous susceptible F 2 individuals was analyzed 
in the case of individuals #8 and #9 (marked by a *) . The 
following allele patterns with respect to the flanking RFLP 
loci were observed: (i) homozygous resistant F 3 progeny showed 
on both sides of Mlo either only the allelotypes of loci WG114 
and ABG366 derived from the mlo-1 parent (individuals #1, #3, 
#6, #7) or only the allelotypes derived from the mlo-5 parent 
(individuals #2, #4, #5). (ii) Homozygous susceptible F 3 
progeny showed in contrast either only the allelotypes of both 
loci derived from the mlo-5 parent (no. #3, #5, #6) or they 
showed different allelotypes on both sides of Mlo (individuals 
#1, #2, #4, #7). (Hi) The homozygous susceptible F 3 progeny 
with different allelotypes on both sides always contain in 
centromeric orientation the mlo-l derived WG114 allele and in 
telomeric orientation the mlo-5 derived ABG366 allele, (iv) The 
heterozygous susceptible F 2 individual #8 reveals on either 
side next to Mlo only the alleles derived from parent mlo-5. 
The heterozygous susceptible individual #9 reveals in 
centromeric orientation alleles derived from both parents mlo-l 
and mlo-5 whereas only the mlo-5 derived allele is detected in 
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telomeric orientation. A comprehensive interpretation of the 
data suggests that susceptibility in F 2 individuals no. #1, #2 , 
#4, #7, and #9 is caused by a cross -over type of recombination 
restoring the Mlo wild type allele. Non cross-over types of 
recombination may have restored the Mlo wild type allele in 
individuals no. #3, #5, #6, and #8. 

A compilation of the detected flanking RFLP alleles of all 
isolated susceptible F 2 individuals or homozygous F 3 progeny is 
shown in Table 3. Note that individual #3 of the cross mlo-8 x 
mlo-1 is not shown in Figure 4. The compilation reveals that 
(i) cross-over types of recombination (CO) and non cross-over 
types of recombination (NCO) are found with a ratio of 7 : 5, 
Hi) cross-over types of recombination are resolved 
unidirectional, and (iii) NCO recombinants were not observed 
with parental mlo-2-linked RFLP alleles. 

The CO type intragenic recombinants isolated from 
heteroallelic mlo crosses were used to test whether wild type 
sequences of the Mlo candidate gene had been restored. For the 
three relevant alleles mlo-l t mlo-5, mlo-8 alleles candidate 
mutation sites have been identified (Table 1 and 4). Direct 
PCR sequencing of genomic DNA of susceptible intragenic 
recombinants derived from both heteroallelic crosses mlo- 2 x 
mlo-8 and mlo-1 x mlo-5 revealed restoration of wild type 
sequences (Table 4) . This observation strongly suggests that 
the intragenic cross over event occurred between nucleotide -1 
and +483 in the former and +3 and +483 in the latter cross 
(according to translat ional start site). Thus, the molecular 
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analysis of seven intragenic recombinants from two 
heteroailelic crosses provides final proof that the above 
described candidate gene represents Mlo. 

EXAMPLE 2 - H0MOL0GUES OF THE IDENTIFIED MLO GENE 

The available expressed sequence tag (EST) databases of 
Oryzae sativa (rice) and Arabidopsis thaliana were searched for 
homologous protein sequences. Five Arabidopsis cDNA clones were 
identified whose deduced amino acid sequences show substantial 
similarity to the Mlo protein. Remarkable is cDNA clone 
205N12T7 which reveals a chance probability of 1.2 e' 45 . In 
addition, at least one significant homologue was found in rice 
(OSR1S381A) . 

A rice BAC library (Wang et al . , 1995) has also been 
screened with a labelled barley genomic fragment containing 
Mlo. A BAC clone containing an insert of -23 kb was isolated. 
Subsequent subcloning enabled isolation of a 2 . 5 kb Pst I 
genomic rice fragment showing strong cross -hybridization with 
the barley Mlo gene probe. DNA sequencing of this fragment 
revealed remarkable DNA sequence similarities within exon 
sequences of the barley Mlo gene (Figure 5) . 

Finally, a 13 kb X genomic barley clone derived from 
cultivar Igri (Stratagene) was isolated with a labelled barley 
genomic fragment containing Mlo. The nucleotide sequence 
derived from a subcloned 2.6 kb Sac I fragment reveals again 
extensive sequence similarities to the Mlo gene (Fig. 6) . The 
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location of the barley Mlo homologue within the genome is not 
within BAC F15 DNA. 

In summary, there is conclusive evidence for Mlo 
homologues both in a monocotyledonous and a dicotyledonous 
plant species. 

Discussion 

Any speculation as to mode of action of Mlo and mlo 
nucleic acid and polypeptides should provide no limitation on 
the nature or scope of any aspect or embodiment of the present 
invention . 

In plants, resistance to pathogens is frequently 
determined by dominant resistance genes, whose products are 
assumed to recognize pathogen-derived avirulence gene - products . 
This mode of pathogen defence follows Flor's gene-f or-gene 
hypothesis (Flor, 1971) . Recently, several *gene-f or-gene' type 
resistance genes have been molecularly isolated (Martin et al . , 
1993; Bent et al . , 1994; Jones et al . , 1994; Mindrinos et al . , 
1994; Whitham et al . , 1994; Grant et al . , 1995; Lawrence et 
al., 1995; Song et al . , 1995). The surprising finding is that 
the deduced proteins share remarkable similar structural 
domains although they trigger resistance reactions to pathogens 
such as viruses, fungi, and bacteria (Dangl, 1995; Staskawicz 
et al., 1995). The isolated genes code for proteins that either 
contain a leucine- rich region (LRR) , with or without an 
attached nucleotide binding site (NBS) , indicative of ligand- 
binding and protein-protein interaction or encode a simple 
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serine/threonine kinase. A structural combination of LRR and 
the kinase domain has been reported in the deduced protein from 
the rice Xa21 resistance gene (Song et al . , 1995). The 
structural similarity of resistance genes in ' gene- f or-gene ' 
5 defence makes the existence of a common underlying resistance 
mechanisms likely . 

Resistance mediated by recessive resistance alleles of the 
Mlo gene differs in various aspects from ' gene- f or-gene ' 
resistance (see introductory comments above) . The molecular 

10 isolation of the Mlo gene and the sequencing of various 

mutation- induced znlo alleles described here, confirms previous 
interpretations from combined mutational and Mendelian genetic 
studies (Hentrich, 1979; Jergensen, 1983). It is concluded that 
defective alleles of the Mlo locus mediate broad spectrum 

15 resistance to pathogens such as the powdery mildew pathogen. 
This is inconsistent with the involvement of a specific 
recognition event of a pathogen-derived product as has been 
proposed for race-specific resistance genes. 

Pleiotropic effects of mlo alleles have provided some 

2 0 clues towards the development of a molecular concept of the 
observed broad spectrum resistance response. 

Firstly, aseptically grown mlo plants exhibit at a high 
frequency a spontaneous formation of cell wall appositions 
(CWAs) in leaf epidermal cells (Wolter et al . , 1993). Those 

2 5 CWAs are usually formed in response to attempted pathogen 

penetration directly beneath the fungal apressorium. CWAs are 
believed to form a physical barrier against pathogen ingress 
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and have been implicated repeatedly in mlo mediated resistance 
(Bayles , 1990) . 

Secondly, at a later stage, the plants develop 
macroscopically detectable leaf necrotic flecks. The 
5 spontaneous leaf necrosis response has been extensively studied 
with a unique collection of 95 chemically- induced mlo alleles 
(Hentrich, 1979) . The alleles were classified as either showing 
a gradually different infection phenotype upon infection of a 
mixture of nine powdery mildew isolates. Those mlo alleles 
10 which give rise to an intermediate infection phenotype (i.e. 
development of a considerable number of sporulating fungal 
colonies upon inoculation) showed no detectable spontaneous 
leaf necrosis whereas the category of the most effective 
resistance alleles exhibits pronounced necrosis in the"~absence 
15 of the pathogen. Thus, there is solid evidence that the former 
category of mlo alleles retain residual wild type allele 
activity and those alleles appear to exhibit no detectable 
spontaneous leaf necrosis. 

Thirdly, a constitutive expression of defence-related 
20 genes has been observed in mlo seedlings grown under mildew- 
free conditions - in primary leaves when 10-11 days old; this 
includes genes of the PR-1 family, chitinases and peroxidases. 

We have shown that mlo in barley confers increased 
resistance to different types of yellow rust (Puccinia 
25 strucif ormis) when a one to one mixture of talcum powder and 
spores were aviblown onto leaves of mlo barley plants after 
onset of constitutive expression of defence related genes (10- 



WO 98/04586 



PCT/GB97/02046 



83 

11 day old mlo seedlings) . 

Thus, it appears that multiple defence-associated 
responses are constitut ively expressed in mlo plants. 

The temporal relationship of these events is interesting: 
5 the onset of constitutive defence-related transcript 

accumulation is detected in 11 day-old seedlings and precedes 
CWA formation which is followed by the appearance of 
macroscopically visible leaf necrosis. Importantly, however, 
mlo resistance can be experimentally tested as early as in five 
10 day-old seedlings and is fully functional at this time. We 
conclude that the Mlo protein has a negative regulatory 
function in plant defence and that plants with a defective 
protein are 'primed' for the onset of defence responses. 
The deduced amino acid sequence of Mlo reveals no 
15 significant homologies to any of the described plant resistance 
genes so far, supporting the idea of a distinct molecular 
resistance mechanism. The Mlo gene shows also no striking 
similarities to any characterized plant or mammalian gene 
sequence in the various data bases. However, highly significant 
2 0 homologous sequences have been identified in the EST and 
genomic databases both from rice and Arabidopsis thallana 
(Table 5 and Figure 5) . This strongly suggests that the Mlo 
protein represents a member of a novel protein family. A 
putative nuclear localization motif (NL.S) is found within exon 
25 12 providing indication of nuclear localization of the protein 
(KEKKKVR; Nigg et al . , 1991). The significance of this motif is 
supported by a casein kinase II motif located 14 amino acids 
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into direction of the NH 2 - terminus (SIFD; Rihs et al . , 1991). 
Functional tests may examine the putative subcellular 
localization of the Mlo protein. 

Mutations have been described also in other plant species 
in which defence responses to pathogens appear to be 
const itutively expressed (Walbot et al . , 1983; Pryor, 1987; 
Jones, 1994). It has been suggested that this class of mutants, 
termed lesion mimics (Les) or necrotic mutants (nec) , affect 
the control of plant defence responses. Recessively inherited 
lesion mimic mutants have been systematically analysed in 
Arabidopsis thaliana (Greenberg and Ausubel, 1993; Dietrich et 
al., 1994; Greenberg et al . , 1994; Weymann et al . 1995). The 
affected genes have been designated acd (accelerated cell" 
death; acdl and acd2) or lsd (lesions simulating disease 
resistance response; lsdl to lsd7) , 

Each of the mutants exhibits, in the absence of pathogens, 
HR characteristics such as plant cell wall modifications and 
the accumulation of defence-related gene transcripts. Leaves of 
the acd2 mutant have been shown to accumulate high levels of 
salicylic acid and of the Arabidopsis phytoalexin, camelexin 
(Tsuji et al., 1992). Importantly, acd and lsd mutants exhibit 
elevated resistance to a bacterial (P. syringae) and fungal (P. 
parasitica) pathogen. The lsdl mutant is exceptional in that it 
confers heightened pathogen resistance at a prelesion state, in 
contrast to the other defective loci which exhibit elevated 
pathogen resistance only in the lesion-positive state. In this 
respect, lsdl resembles the nilo mutants in barley. Another 
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striking feature of Isdl is the indeterminate spread of lesions 
in contrast to the other mutants where lesion growth is 
determinate . 

EXPERIMENTAL PROCEDURES 

Plant Material 

A compilation of the mlo mutants and their mother 
varieties analyzed in this study has been described by 
Jargensen (1992) [mlo-1, mlo-3 , mIo-4, mlo-5, mlo-1 , mlo-8, 
mlo- 9, mlO'lOr mlo- 11] and by Habekuss and Hentrich (1988) 
[mutants in cultivar Plena 2018 (mlo-13), 2034 (znio-17) , 2118]. 
Since mutant 2118 has not been assigned to an allele number so 
far, we designate the allele here as mlo- 26 , according to 
current numbering in the GrainGene database 

(gopher : //greengenes . cit . Cornell . edu : 70/77/ . graingenes . ndx/ 
index?mlo) . 

The high resolution map is based on a cross between Ingrid 
Mlo x BC 7 Ingrid mlo- 3. F x plants were selfed generating a 
segregating F 2 population of approximately 600 plants. 
Phenotypically susceptible F 2 plants which showed 
heterozygosity for RFLP markers on opposite sites of Mlo were 
selfed and generated further segregants in the F 3 generation 
for high resolution mapping. 

Powdery Mildew Infection Tests 

The fungal isolate Kl (Hinze et al., 1991) is virulent on 
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all cultivars used in this study carrying the Mlo allele and 
avirulent on all tested mlo genotypes. Plant growth and 
inoculation with Erysiphe grztminis f sp hordei were carried out 
as described previously (Freialdenhoven et al . , 1996). The 
genotype at Mlo of recombinants used for the high resolution 
map were determined after selfing and subsequent inoculation 
experiments in F 3 or F 4 families comprising at least 24 
individuals . 

AFLP Analysis 

Genomic DNA for AFLP analysis was isolated according to 
Stewart and Via (1993) . AFLP analysis was carried out with 
minor modifications as described by Vos et al. (1995). For 
screening of AFLP markers linked to Mlo we used the enzyme 
combinations Pst I/Mse I with amplification primers carrying +2 
and +3 selective bases respectively in genomic sequences of 
amplified fragments. For Eco RI/Mse I amplification primers we 
used +3 and +3 selective bases respectively. A set of four DNA 
templates has been used: from the susceptible parent cultivar 
Ingrid Mlo, the resistant parent BC 7 lngrid mlo-3 f a pool of two 
resistant F 2 individuals (mlo-3 mlo-3) and a pool of nine 
susceptible F 2 individuals {Mlo Mlo) derived from the cross 
Ingrid Mlo x BC 7 Ingrid mlo-3. Amplified genomic fragments 
representing AFLP markers Bpm2, Bpm9 , and Bpml6 (Figure 1) were 
cloned and sequenced as follows: gel pieces (fixed by vacuum 
drying to Whatman 3 MM paper) containing the amplified genomic 
fragments were identified via autoradiography and subsequently 
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excised. 100 /il water were added, boiled for 10 min, and after 
centrif ligation 5 fxl of the supernatant were used as a template 
for non- radioactive reamplif ication (30 cycles) with the 
selective AFLP primers. Amplification products were isolated 
5 after agarose gel using a DNA isolation kit (Jetsorb, Genomed 
Inc., USA) * DNA was reated with Klenow polymerase and T4 
polynucleotide kinase and subsequently cloned in the EcoRV site 
of pBluescript SK (Stratagene) . Sequencing reactions were 
performed using a dye terminator cycle sequencing reaction kit 
10 (Perkin Elmer) and resolved either on an ABI 373 or 377 
(Applied Biosystems) automated sequencer. 

Barley YAC Library and BAC Sublibrary Construction of YAC 
YHV3 03-A6 

15 The YAC library of barley cultivar Ingrid was established 

using the pYAC4 vector (Burke et al . , 1987; Kuhn and Ludwig 
1994) and yeast" strain AB 1380. Details of the library 
construction and its characterization will be described 
elsewhere. Screening for YAC clones containing marker BpmlG 

20 was done by AFLP analysis. For construction of a BAC 

sublibrary of YAC YHV303-A6, total DNA of this yeast clone was 
used. After partial Eco RI digestion and preparative pulsed- 
field gel electrophoresis, DNA fragments in the size range of 
50 kb were recovered and subcloned in the pECSBAC4 vector. 

25 Clones carrying YHV303-A6 derived inserts were identified by a 
two-step colony hybridization procedure. First total labelled 
DNA of the non- recombinant yeast strain AB 1380 was used as a 
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probe to eliminate most of the clones carrying insert DNA 
derived from the host strain. In a subsequent hybridization 
step the remaining clones were probed with the labelled 
recombinant chromosome YHV3 03-A6 after enrichment by 
preparative pulsed-field gel electrophoresis. 

DNA Sequencing of BAC F15 

DNA of BAC F15 was isolated by an alkaline lysis large 
scale plasmid preparation according to Sambrook et al . (1989) . 
50 fig of purified DNA were nebulized by high pressure treatment 
with argon gas in a reaction chamber for 150 seconds. The ends 
of the sheared and reprecipitated DNA were blunt -ended by a T4 
DNA polymerase -mediated fill in reaction. DNA fragments in the 
size range between 8 00 bp and 3 kb were isolated from "agarose 
gels using a DNA isolation kit (Jetsorb, Genomed Inc., U.S.A.), 
subcloned into the pBluescript SK vector (Stratagene) and 
propagated in £. coli DN5a . Clones carrying BAC F15 derived 
inserts were selected by hybridization using the sheared DNA of 
BAC F15 as a probe. Sequencing reactions were performed as 
described above. Evaluation of the sequencing data, 
construction of sequence contigs, and estimation of coding 
propabilities were done by means of the STADEN software package 
for Unix users (4th edition, 1994) . Assessment of coding 
probabilities was based on a combined evaluation of uneven 
positional base frequencies, positional base preference and 
barley codon usage in the investigated contigs. Homology 
searches were done using the BLAST software. 
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PCR-based Sequencing of Alleles at Mlo 

Plant chromosomal DNA for this purpose was isolated 
according to Chunwongse et al . (1993). DNA sequences of Mlo 
alleles of the different barley varieties, mlo mutants, BC 
5 lines, and intragenic recombinants used in this study were 
obtained by PCR-based sequencing. Seven overlapping 
subfragments of the gene (each 400 bp- 600 bp in length) were 
amplified by PCR (35 cycles, 60 "C annealing temperature) using 
sets of specific primers. After preparative agarose gel 

10 electrophoresis and isolation of the amplification products 
using the Jetsorb kit (Genomed Inc., U.S.A.) fragments were 
reamplif ied to increase specificity. The resulting products 
were subsequently purified from nucleotides and 
oligonucleotides (Jetpure, Genomed Inc., U.S.A.) and used as a 

15 template in DNA sequencing reactions (see above) . All DNA 

sequences of mutant alleles and corresponding regions of the 
parental lines and the intragenic recombinants were derived 
from both strands and confirmed two times in independent sets 
of experiments. In addition, mutant alleles mlo-1, mlo-3 , mlo- 

20 4, mlo-5, mlo-1 , mlo- 8, mlo-3, and mlo-10 were also verified in 
the corresponding BC lines in cultivar Ingrid. 

RT-PCR and Rapid Amplification of cDNA Ends (RACE) 
RT-PCR was performed using the SUPERSCRIPT 
25 preamplif ication system for first strand cDNA synthesis (Gibco 
BRL) . Total RNA (1 jzg) of seven-day-old primary barley leaves 
(cultivar Ingrid) served as template. First strand cDNA 
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synthesis was primed by an oligo(dT) primer. The putative 
coding region of the Mlo gene was subsequently amplified using 
oligonucleotides 25L (GTGCATCTGCGTGTGCGTA) and 3 8 
(CAGAAACTTGTCTCATCCCTG) in a single amplification step (35 
cycles, 60*C annealing temperature). The resulting product was 
analyzed by direct sequencing, 5'- and 3 '-ends of the Mlo cDNA 
were determined by RACE (Frohman et al., 1988) using the 
MARATHON cDNA amplification kit (Clontech) . Corresponding 
experimental procedures were mainly carried out according to 
the instructions of the manufacturer. To obtain specific RACE 
products, two consecutive rounds of amplification (35 cycles, 
55 # C annealing temperature) were necessary. For this purpose, 
two sets of nested primers were used in combination with the 
adapter primers of the kit: oligonucleotides 46 

(AGGGTCAGGATCGCCAC) and 55 (TTGTGGAGGCCGTGTTCC) for the 5' -end 
and primers 3 3 (TGCAGCTATATGACCTTCCCCCTC) and 3 7 
(GGACATGCTGATGGCTCAGA) for the 3' -end. RACE products were 
subcloned into pBluescript SK (Stratagene) . Ten 5' -end and 
eight 3' end clones were chosen for DNA sequence analysis. 

The term "AFLPs" is used herein to refer to "AFLP 
markers" . 
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Table 1 summarizes the identified mutation sites of 
various mutants within the Mlo gene. The origin, the mutagen 
and the predicted effect of the mutation at the amino acid 
level are indicated. 
5 Table 2 shows the results of heteroallelic mlo crosses and 

selfings of the respective mlo lines to isolate intragenic 
recombinat ion events . 

Table 3 summarizes the genotypes at flanking RFLP markers 
in susceptible F 2 or homozygous F 3 progeny from the intermutant 
10 crosses. CO and NCO indicate crossover type and non crossover 
type recombinants deduced from flanking molecular marker 
exchange. Table 3 summarizes DNA sequence analysis of 
suceptible intragenic crossover type recombinants (from 
homozygous susceptible F 3 progeny) and the corresponding 
15 parental mlo mutant lines. Sequences flanking the identified 
mutation sites are shown. 

Table 4 shows the results of direct PCR sequencing of 
genomic DNA of susceptible intragenic recombinants derived from 
both heteroallelic crosses mlo-1 x mlo-8 and mlo-1 x mlo-5, 
20 revealing restoration of wild type sequences. 

Table 5 shows several Arabidopsis thaliana and two rice 
expressed sequence tags (ESTs) with homology to the Mlo 
protein . 

Table 5A show amino acid sequences, with "query" 
25 indicating part of the Mlo protein sequence to which homology 
has been found, with the predicted amino acid sequence of each 
identified EST marked with "subject". 
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Table 5B shows EST nucleotide sequences encoding the amino 
acid sequences shown in Table 5A. GenBank Accession number 
T22145 (definition 4153 Arabidopsis thaliana cDNA clone 97N8T7, 
NCBI Seq ID 932185) , number T22146 (definition 4153 Arabidopsis 
5 thaliana cDNA clone 97N9T7, NCBI Seq ID 932186), number N37544 
(definition 18771 Arabidopsis thaliana cDNA clone 205N12T7, 
NCBI Seq ID 1158686) , number T88073 (definition 11769 
Arabidopsis thaliana cDNA clone 155I23T7, NCBI Seq ID 935932) 
number H76041 (definition 17746 Arabidopsis thaliana cDNA clone 

10 193P6T7, NCBI seq ID 1053292), number D24287 (rice cDNA partial 
sequence R1638_1A, nID g428139) and D24131 (rice cDNA partial 
sequence R1408_1A, nID g427985) are shown. The Arabidopsis 
sequences are from Newman et al . (1994) Plant Physiol. 106 
1241-55. The rice sequences are from Minobe f Y. and Sasaki, T. 

15 submitted 2 Nov 1993 to DDBJ . 
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Table 2 



F2 progeny from intermutant crosses and selfings 



Testcrosses resistant susceptible frequency of 

susceptible F2 progeny 



mlo-8xmlo-1 5,281 3 5.7 x 10" 4 

mlo-5 x mlo-1 915 0 

mlo-5 x mlo-1 14.474 9 6.2 x10" 4 



selfings resistant susceptible 



mlo-1 12,634 0 

mlo-5 5,498 0 

mlo-8 8,435 0 
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TABLE 5A 



>EM EST1:AT1452 T22145 4153 Arabidopsis thaliana cDNA clone 97N8T7 . 11/95 
Length - 382 

Plus Strand HSPs: 

Score « 248 (115,9 bits) , Expect - 2.9e-27, P « 2.9e-27 
Identities «= 47/100 (47%) , Positives - 67/100 (67%), Frame « +2 

Query: 242 KriKRSMEDDFKVVVGrSLPLWGVAILTLFLDINGVGTLIWrgFrPLVXLLCVGTKLEMI 301 

KY+ R++EDDFK WGIS LW ++ L++NG T WI+FIP +LL VGTKLE + 
Sbjct: 2 KYMMRALEDDFKQ WG I S WYLWXF WIFXLLNVNGWHT YFWIAF IP FXLLLAVGTKLEH V 181 

Query: 302 IMEMALEIQDRASVIKGAPWEPSNKFFWFHRPDWVLFFI 341 

I ++A E+ ++ I+G W+P . + FKF +P VL+ I 
Sbjct: 182 IAQLAHEVAEKHVAIEGDLWKPXXEHFWFSKPQIVLYLI 301 



>EM EST1:AT1462 T22146 4154 Arabidopsis thaliana cDNA clone 97K9T7 . 11/95 
Length - 390 

Plus Strand HSPs: 

Score - 212 (99.1 bits), Expect - 4.2e-26 r Sum P(2) « 4.2e-26 
Identities « 41/83 (49%) , Positives - 58/83 (69%) , Frame - +2 

Query* 242 KYIKRSMEDDFKVWGISLPLWGVAILTLFLD INGVGTLIWI SFIPLVILLCVGTKLEMI 301 

K*+ R++EDDFK WGIS LW ++ L L++NG T WI+FIP +LL VGTKLE + 
Sbjct: 2 KYMMRALEDDFKQWGISWYLWXFWIFLLLNVNGWHT 181 

Query: 302 IMEMALEIQDRASVIKGAPWEP 324 

I ++A E+ ++ I+G W+P 
Sbjct: 182 IAQIAHEVAEKHVAIEGDLWKP 250 

Score - 52 (24.3 bits), Expect - 1.9, Sum P.(2) - 0.85 
Identities - 9/32 (28%) , Positives - 16/32 (50%) , Frame - +2 

Query: 18 WAVAWFAAMVLVSVLMEHGLHKLGHWFQHRH 49 

W + FA ++ V +EH + +L H +H 
Sbjct: 122 WIAFIPFALLLAVGTKLEHVIAQLAHEVAEKH 217 



Score - 49 (22.9 bits), Expect - 4.2e-26, Sum P(2) - 4.2e-26 
Identities - 8/17 (47%), Positives - 12/17 (70%), Frame « +1 

Query: 323 EPSNKFFWFHRPDWVLF 339 

E S++ FWF +P VL+ 
Sbjct: 244 ETSDEHFWFSKPQXVLY 294 
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TABLE 5A cont'd 



>EM EST1:AT54418 N37S44 18771 Arabldopsis thaliana cDNA clone 20SN12T7. 1/96 
Length - 585 

Plus Strand HSPs: 

Score - 277 (129*5 bits), Expect - 1.2e-45, Sum P(2) ■= 1.2e-45 
Identities = 51/96 (53%), Positives - 71/96 (73%), Frame ■= +1 

Query: 236 SKFDFHKYTKRSMEDDFKVVVGISI*PLWGV^ 295 

S+FDF KYI+RS+E DFK W.IS +W VA+L L + G+ + +-W4- FIPLV++L VG 
Sbjct: 127 SRFDFRKYIQRSLEKDFKTVVEISPVrWFVAVLFLLTNSyGLRSYIiWLPFrPLVVILrVG 306 

Query: 296 TKLEMIIMEMALEIQDRASVIKGAPWEPSNKFFWF 331 

TKLE-f II -H- L IQ+ V-f+GAPW+P + FWF 
Sbjct: 307 TKLEVIITKLGLRIQEEGDWRGAPVVQPGDDXFWF 414 

Score - 121 (56.6 bits), Expect « 1.2e-4S, Sum P(2) • 1.2e-45 
Identities « 25/45 (55%), Positives - 29/45 (64%), Frame « +1 

Query: 196 SSTPGIRWWAFFRQFFRSVTKVDYLTLRAGFINAHLSQNSKFDF 240- 

S T W+V FFRQFF SVTKVDYL L GFI AH + ++ F 
Sbjct: 1 SKTRVTLWrvCFFRQFFGSVTKVDYIALXHGFIMAHFAPGNESRF 135 



>EM EST1:AT04117 H76041 17746 Arabidopsis thaliana cDNA clone 193P6T7. 11/95 
Length - 476 

Plus Strand HSPs: 
Score - 210 (98.2 bits), Expect - 9.0e-36, Sum P(2) - 9.0e-36 
Identities « 43/86 (50%), Positives - 58/86 (67%), Frame - +1 



Query 
Sbjct 
Query 
Sbjct 



196 SSTPGIRWWAFFRQFFRSVTKVDYLTLRAGFINAHLSQNSECFDFHKYIKRSMEDDFKVV 255 

-H-TP V FFRQFF SV + DYLTLR GF +AHL+ KF4-F +YIK S+EDDFK+V 
124 TTTPFXFNVrcFFRQFFVSVERTDYLTI^GFXSAHIJu?^ 303 

256 VGISLPLWGVAILTLFLDINGVGtLI 281 

VGI LW ++ L + +GT++ 
304 VGIXPVLWASFVIFLAVQX*WLGTIV 381 



Score - 119 (55.6 bits), Expect - 9.0e-36, Sum P(2) - 9.0e-36 
Identities - 24/57 (42%), Positives - 32/S7 (56%), Frame - +1 

Query; 156 MRTWKKNETETTSLEYQFANDPARFRFTHQTSFVKM 212 

+R KKKHE T S +Y F D +R R TH+TSFV+ H +T + V F + F 
Sbjct: 1 IRGWKKWEQXTLSNDYXFXIDHSRLRLTHETSFVREHTSFWTTTPFXFNVGCFFRQF 171 



Score - 40 (18.7 bits), Expect - 1.2e-08, Sum P(2) - 1.2e-08 
Identities - 8/19 (42%), Positives - 10/19 (52%), Frame - +2 

Query: 269 TLFLDINGVGTLIWISFIP 287 
+L+WGGLW$P 

Sbjct: 344 S L LFNXKGWGP LFW AS VP P 400 
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TABLE 5A cont r d 



>EM EST1:AT0739 T88073 11769 Arabidopsis thaliana cONA clone 1S5I23T7. H/95 
Length - 460 

Plus Strand HSPs : 

Score ■= 175 (81.8 bits) , Expect « 1.2e-24, Sura P(2) - 1.2e-24 
Identities - 31/67 (<6%) , Positives «= 43/67 (64%), Frame -= +1 

Query: 14 6 VITIALSRLKMRTWKK^TETTSLEYQFAITOPARFRFTHQTSFVKRHLGLSSTPGIRWVV 205 

++T A ++KKRTWK WE ET ++EYQ-M-NDP RFRF TSF +RHL S + + 
Sbjct: 4 rVTYAFGKIKMRTWKSWEEETKTIEYQYSNDPERFRFARDTSFGRRHLMreSKTRVTLWI 183 



Score - 121 (56.6 bits), Expect ■= 1.4e-14, Sura P(2) *= 1.4e-14 
Identities «= 25/45 (S5%) , Positives «= 29/45 (64%), Frame « +1 

Query: 196 SSTPG IRWVVAFFRQFFRS VTKVD YLTLRAGF INAHLSQNSKFDF 240 

S T W4-V FFRQFF SVTKVDYL L GFI AH + ++ F 
Sbjct: 157 SKTRVTLWIVCFFRQFFGSVTKVDYLALXHGFIMAHFAPGKESRF 291 

Score « 75 (35.1 bits), Expect « 1.2e-24, Sura P(2) « 1.2e-24 
Identities « 14/21 (66%), Positives *= 17/21 (80%), Frame - +1 

Query: 236 SKFDFHKYIKRSMEDDFKVW 256 

S+FDF KYI+RS+ DFK W 
Sbjct: 283 SRFDFRKYIQRSLXXDFKTW 345 



>EM EST5:OSR16381A D24287 Rice cOKA, partial sequence (R1638_1A) . 5/95 
Length • 400 
Plus Strand HSPs: 

Score - 147 (68.7 bits), Expect ■» l.9e-16, Sum P(2) « 1.9e-16 
Identities «= 26/S3 (49%), Positives - 3S/53 (66%), Frame « +1 

Query: 236 SKFOFHKYIKRSMEDDFKWVGISLPLWGVAILTLFLDINGVGTLIWISFIPL 288 

++F+F KYIKR +EDDFK WGIS P W A+ + +++G L W S PL 
Sbjct: 202 TRF^RKYIKRXLEDDFKT\A^ISAPXWASAIAIMLFNvKGWHNLFWFSTXPL 360 

Score - 45 (21.0 bits), Expect - 1.9e-16, Sum P(2) « 1.9e-16 
Identities - 9/15 (60%), Positives - 11/15 (73%), Frame - +2 

Query: 287 PLVTLLCVGTKLEMI 301 

PL + L VGTKL+ I 
Sbjct: 356 PLXVTLAVGTKLQAI 400 



>EM ESTS:OSS1692A D39989 Rice cDNA, partial sequence (S1692 1A) . 11/94 
Length - 343 ~ 

Plus Strand HSPs: 
Score - 95 (44.4 bits), Expect - 0.00059, P - 0.00059 
Identities - 24/58 (41%), Positives - 31/58 (53%), Frame - +3 

Query: 43 HWFQHRHKKALWEALEKMKAELMLVGFISLLLIVTQDP 1 1 AKIC ISEDAAD VMKPCKR 100 

H + H+ L +A+EKMK E+ML4-GF I SLLL T I S+ PC R 

Sbjct: 3 HXSEKTHRNPLHKAMEKMKEEMMLLGFISLLLAATSRIISGICIDSKYYKSNFSPCTR 176 
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TABLE 5B 100 



GenBank Accession Number T22145 



1 caagtatatg atgcgcgctc tagaggatga tttcaaacaa gttgttggta ttagttggta 

61 tctttggntc tttgtcgtca tctttttnct gctaaatgtt aacggatggc acacatattt 

121 ctggatagca tttattccct ttnctttgct tcttgctgtg ggaacaaagt tggagcatgt 

181 nattgcacag ttagctcatg aagttgcaga gaaacatgta gccattgaag gagacttagt 

241 ggtgaaaccc ncanatgagc atttctggtt cagcaaacct caaattgttc tctacttgat 

301 cccattttat cctctttccc agaatgcntt ttnagantgc nttttttnnt tttggnnttt 

361 ggggtaanan annggtttcg nc 



GenBank Accession Number T22146 

1 caagtatatg atgcgcgctc tagaggatga tttcaaacaa gttgttggta ttagttggta 

61 tctttggntc tttgtcgtca tctttttgct gctaaatgtt aacggatggc acacatattt 

121 ctggatagca tttattccct ttgctttgct tcttgctgtg ggaacaaagt tggagcatgt 

181 nattgcacag ttagctcatg aagttgcaga gaaacatgta gccattgaag gagacttagt 

241 ggtgaaacct cagatgagca tttctggttc agcaaacctc aaantgttct ctactngatc 

301 cnctttatcc cccttccaga atgccttttt nangattcnn ntttttcctt nttgganntt 

361 ttgggnnnnc aaacgggntt nggacctccg 



GenBank Accession Number N37544 



1 agcaagacga gagtcacact atggattgtt tgttttttta gacagttctt tggatctgtc 
61 accaaagttg attacttagc actaagncat ggtttcatca tggcgcattt tgctcccggt 
121 aacgaatcaa gattcgattt ccgcaagtat attcagagat cattagagaa agacttcaaa 
181 accgttgttg aaatcagtcc ggttatctgg tttgtcgctg tgctattcct cttgaccaat 
241 tcatatggat tacgttctta cctctggtta ccattcattc cactagtcgt aattctaata 
301 gttggaacaa agcttgaagt cataataaca aaattgggtc taaggatcca agaggaaggt 
361 gatgtggtga gaggcgcccc agtggttcag cctggtgatg accncttctg gtttngnaan 
421 cacgnttcaa tnttttccnt antcacttng gcctttttan gggtgaattt caacttcatn 
481 ctttncctgg ggncggatga ttcaatccaa naatnttccc ctgaagnctn caagtttggg 
541 cataggcttt nggtgggntt ttcaganttt nagtttggct tnccc 
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TABLE 5B (Continued) 



GenBank Accession Number T88073 



1 tgcattgtta cttatgcttt cggaaagatc 
61 gagacaaaga caatagagta tcagtattcc 
121 gacacatctt ttgggagaag acatctcaat 
181 attgtttgtt tttttagaca gttctttgga 
241 agncatggtt tcatcatggc gcattttgct 
301 aagtatattc agagatcatt agngnaagac 
361 tatctggttt gtcggctgtg ctattccnct 
421 tggtaccatt attcnctagc ggaatntaaa 



aagatgagga cgtggaagtc gtgggaggaa 
aacgatcctg agaggttcag gtttgcnagg 
ttctggagca agacgagagt cacactatgg 
tctgtcacca aagttgatta cttagcacta 
cccggtaacg aatcaagatt cgatttccgc 
ttcaaaaccg ttgtttgaaa tcagtccggt 
tgaccaattc atatggntnc ggtnttncnc 
agttggcnga 



GenBank Accession Number H76041 



1 attcgtggat ggaaaaagtg ggagcaagan 
61 gatcattcaa gacttaggct cactcatgag 
121 tggacaacaa cncctttctn ctttaacgtc 
181 gtngaaagaa ccgactactt gactctgcgc 

2 41 ggaagaaagt tcaacttcca gagatatatc 
301 gtagttggaa taagnccagt tctttgggca 

3 61 taatggctgg ggaccattgt tttgggcntc 
421 ttggccaagg ttcaaggaat ttngggacaa 



acattatcta atgactatna gtttnctatt 
acttcttttg tnagagaaca tacaagtttc 
ggatgcttct ttaggcagtt ctttgtatct 
catggattca nctctgccca tttagctcca 
aaangatctc tcgaggatga ttztcaagttg 
tcatttgtaa tcttccttgc tgttcaatgn 
ggtaccgcct ntactcanaa ncccaggctt 
tggggtagaa tcgtgggcnc atnngg 



GenBank Accession Number D2A287 

1 tcntntttnn ttttcgnntn cntccacccc tnnnntnctc nancncnttn nnnttatctc 
61 tnttnttntc ncntntcccn ncaccaccnn ncgacgggcn tggactnngc ccnnngttcg 
121 aggctgccca ctgncgtctg agacctacct tgncatttga cggcacngga cttcanttgc 
181 tgctcacttt atctctacgg gactaggttc aattttcgga aatacatcaa aaggncactg 
241 gaggacgatt ttaagacagt tgttggcatt agtgcacccn tatgggcttc tgcgttggcc 
301 attatgctct tcaatgttca tggatggcat aacttgttct ggttctctac aatncccctt 
361 gntagtaact ttagcagttg gaacaaagct gcaggctata 



GenBank Accession Number D24131 

1 cagactacct gactttgagg cacggattca 
61 tcaattttcg gaaatacatc aaaaggtcac 
121 ttagtgcacc cttatgggct tctgcgttgg 
181 ataacttgtt ctggttctct acaatccccc 
241 tgcaggctat aattgcaatg atggctgttg 
301 gaatgccggt ggtgaactca gtgat 



ttgctgctca tttatctcta gggactaggt 
tggaggacga ttttaagaca gttgttggca 
ccattatgct cttnaatgtt catggatggc 
ttgtagtaac tttagcagtt ggaacaaagc 
aaattaaaga gaggcataca gtaattcaag 
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CLAIMS : 

1. An isolated polynucleotide encoding a polypeptide which 
includes the amino acid sequence shown in Figure 2 . 

2. A polynucleotide according to claim 1 wherein the coding 
sequence is the coding sequence shown in Figure 2. 

3. A polynucleotide according to claim 1 wherein the coding 
sequence is a mutant, allele, variant or derivative of the 
coding sequence shown in Figure 2, by way of addition, 
deletion, substitution and/or insertion of one or more 
nucleotides . 

4. An isolated polynucleotide which on expression in a 
transgenic plant exerts a negative regulatory effect on a 
pathogen defence response of the plant, which defence response 
is pathogen independent and autonomous of the presence of 
pathogen, the polynucleotide encoding a polypeptide which 
includes an amino acid sequence which is a mutant, allele, 
variant or derivative of the Barley Mlo sequence shown in 
Figure 2, or is a homologue of another species or a mutant, 
allele, variant or derivative thereof, the amino acid sequence 
differing from that shown in Figure 2 by way of addition, 
substitution, deletion and/or insertion of one or more amino 
acids . 
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5. A polynucleotide according to claim 4 encoding a 
polypeptide which includes the amino acid sequence shown in 
Figure 13 . 

5 6 . A polynucleotide according to claim 5 wherein the coding 
sequence is that shown in Figure 10. 

7 . A polynucleotide according to claim 5 wherein the coding 
sequence is a mutant, allele, variant or derivative of the 

10 coding sequence shown in Figure 10, by way of addition, 
deletion, substitution and/or insertion of one or more 
nucleotides. 

8 . A polynucleotide according to claim 4 encoding a 

15 polypeptide which includes the amino acid sequence shown in 
Figure 14 . 

9. A polynucleotide according to claim 8 wherein the coding 
sequence is that shown in Figure 11. 

20 

10. A polynucleotide according to claim 8 wherein the coding 
sequence is a mutant, allele, variant or derivative of the 
coding sequence shown in Figure 11, by way of addition, 
deletion, substitution and/or insertion of one or more 

25 nucleotides. 

11. A polynucleotide according to claim 4 encoding a 
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polypeptide which includes the amino acid sequence shown in 
Figure 15. 

12. A polynucleotide according to claim 11 wherein the coding 
sequence is that shown in Figure 12 . 

13 . A polynucleotide according to claim 11 wherein the coding 
sequence is a mutant, allele, variant or derivative of the 
coding sequence shown in Figure 12, by way of addition, 
deletion, substitution and/or insertion of one or more 
nucleotides . 

14. A polynucleotide according to any preceding claim operably 
linked to a regulatory sequence for expression. 

15. An isolated polynucleotide encoding a polypeptide which on 
expression in a transgenic plant produces a polypeptide which 
can stimulate or maintain a defence response of the plant, the 
encoded polypeptide including an amino acid sequence which is a 
mutant, allele, variant or derivative of the Barley Mlo 
sequence shown in Figure 2 or of a homologue of another 
species, the amino acid sequence differing from that shown in 
Figure 2 by way of addition, substitution, deletion and/or 
insertion of one or more amino acids. 

16. A polynucleotide according to claim 15 which stimulates or 
maintains said defence response of the plant on homozygous 
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expression in the plant. 

17. A polynucleotide according to claim 15 wherein the amino 
acid sequence includes an alteration identified in Table 1. 

5 

18. A polynucleotide according to claim 17 wherein the amino 
acid sequence is that of Figure 2 including a substitution at 
residue 240. 

10 19. A polynucleotide according to claim 17 wherein the amino 
acid sequence includes Leucine at residue 240. 

20. A polynucleotide according to any of claims 15 to 19 
operably linked to a regulatory sequence for expression. 

15 

21. An isolated polynucleotide which has at least about 600 
contiguous nucleotides of the nucleotide sequence of any of 
claims 1 to 13 or complement thereof 

20 22. A polynucleotide according to claim 21 operably linked to 
a regulatory sequence for transcription. 

23. An isolated polynucleotide which has at least about 300 
contiguous nucleotides of the sequence of any of claims 1 to 
25 13, or complement thereof, operably linked to a regulatory 
sequence for transcription. 
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24 . A polynucleotide according to claim 22 or claim 23 wherein 
the regulatory sequence includes an inducible promoter. 

25 . A nucleic acid vector suitable for transformation of a 
host cell and including a polynucleotide according to any 
preceding claim. 

26. A nucleic acid vector according to claim 25 wherein said 
host cell is a microbial cell. 

27. A nucleic acid vector according to claim 25 wherein said 
host cell is a plant cell. 

28. A host cell containing a heterologous polynucleotide or 
nucleic acid vector according to any preceding claim. 

29. A cell according to claim 28 which is microbial. 

30. A cell according to claim 28 which is a plant cell. 

31. A cell according to claim 30 having said heterologous 
polynucleotide incorporated within its genome. 

32. A cell according to claim 31 having more than one said 
polynucleotide per haploid genome. 

33. A cell according to any of claims 30 to 32 which is 
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comprised in a plant. 

34. A plant including a cell according to any of claims 30 to 
32 . 

5 

35. A plant which is a sexually or asexually propagated off- 
spring, clone or descendant of a plant according to claim 34, 
or any part or propagule of said plant, off- spring, clone or 
descendant . 

10 

36. A part or propagule of a plant according to claim 35. 

37. A plant according to claim 34 which does not breed true. 

15 38. A method of producing a plant, the method including 

incorporating a heterologous polynucleotide according to any of 
claims 1 to 14 into a plant ce,ll and regenerating a plant from 
said plant cell. 

20 39. A method of producing a plant, the method including 

incorporating a heterologous polynucleotide according to any of 

claims 15 to 20 into a plant cell and regenerating a plant from 
said plant cell. 

25 40. A method of producing a plant, the method including 

incorporating a heterologous polynucleotide according to any of 
claims 21 to 24 into a plant cell and regenerating a plant from 
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said plant cell. 

41. A method according to any of claims 38 to 40 including 
sexually or asexually propagating or growing off-spring or a 
descendant of said plant. 

42. A method of stimulating a defence response in a plant, the 
method including causing or allowing transcription from a 
heterologous polynucleotide according to any of claims 1 to 14 
within cells of the plant. 

43. A method of stimulating a defence response in a plant, the 
method including causing or allowing transcription from a 
heterologous polynucleotide according to any of claims 15 to 20 
within cells of the plant. 

44. A method of stimulating a defence response in a plant, the 
method including causing or allowing transcription from a 
heterologous polynucleotide according to any of claims 21 to 24 
within cells of the plant. 

45. A method of producing a polynucleotide encoding a 
polypeptide which on expression in a transgenic plant produces 
a polypeptide which can stimulate or maintain a defence 
response of the plant, the method including alteration of the 
nucleotide sequence of a polynucleotide according to any of 
claims 1 to 14 . 



WO 98/04586 



PCT/GB97/02046 



115 

46. A method according to claim 45 involving site-specific 
sequence mutation. 

47. A method according to claim 45 involving intracellular 
homologous recombination . 

48. A method wherein following alteration of a nucleotide 
sequence in accordance with the method of claim 45 a 
polynucleotide including the altered nucleotide sequence is 
introduced into a host cell. 

49. A method according to claim 48 wherein the host cell is 
plant cell. 



50. A method wherein following introduction of a 
polynucleotide into a plant cell in accordance with claim 4 9 
plant is regenerated from the cell or descendants thereof 
including the altered nucleotide sequence. 

51. Use of a polynucleotide according to any of claims 1 to 
for stimulating a defence response in a plant. 

52. Use of a polynucleotide according to any of claims 15 to 
20 for stimulating a defence response in a plant. 

53. Use of a polynucleotide according to any of claims 21 to 
24 for stimulating a defence response in a plant. 
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54 . Use of a polynucleotide according to any of claims 21 to 
24 for down-regulation of expression of a gene encoded a 
polypeptide encoded by a polynucleotide according to any of 
claims 1 to 14 . 

55. Use of a polynucleotide according to any of claims 1 to 14 
in the production of a transgenic plant. 

56 . Use of a polynucleotide according to any of claims 15 to 
20 in the production of a transgenic plant. 

57. Use of a polynucleotide according to any of claims 21 to 
24 in the production of a transgenic plant. 

58. A method of determining the presence of a pathogen 
resistance or susceptibility allele in a plant or plant cell, 
the method including analysing a sample from the plant or plant 
cell by: 

(a) comparing the sequence of nucleic acid in the sample 
with all or part of the nucleotide sequence shown in Figure 7 
to determine whether the sample from the patient contains a 
mutation ; 

(b) determining the presence in the sample of a 
polypeptide including the amino acid sequence shown in Figure 7 
or a fragment thereof and, if present, determining whether the 
polypeptide is full length, and/or is mutated, and/or is 
expressed at the normal level; 
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(c) performing DNA fingerprinting to compare the 
restriction pattern produced when a restriction enzyme cuts 
nucleic acid in the sample with the restriction pattern 
obtained from the nucleotide sequence shown in Figure 7 or from 
a known mutant, allele or variant thereof; 

(d) contacting the sample with a specific binding member 
capable of binding to nucleic acid including the nucleotide 
sequence as set out in Figure 7 or a fragment thereof, or a 
mutant, allele or variant thereof, the specific binding member 
including nucleic acid hybridisable with the sequence of Figure 
7 or a polypeptide including a binding domain with specificity 
for nucleic acid including the sequence of Figure 7 or the 
polypeptide encoded by it, or a mutated form thereof, and 
determining binding of the specific binding member ; 

(e) performing PCR involving one or more primers based on 
the nucleotide sequence shown in Figure 7 to screen the sample 
for nucleic acid including the nucleotide sequence of Figure 7 
or a mutant, allele or variant thereof. 

59. A method of determining the presence of target nucleic 
acid in a plant or plant cell, the method including contacting 
a nucleic acid molecule which includes the nucleotide sequence 
shown in Figure 7 or an oligonucleotide fragment thereof with 
nucleic acid in a sample from the plant or plant cell and 
assessing hybridisation of said nucleic acid molecule with 
nucleic acid in the sample. 
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60. A method according to claim 59 which involves 
amplification of nucleic acid to which said nucleic acid 
molecule hybridises . 

5 61. A method according to claim 59 or claim 60 wherein said 
nucleic acid molecule includes an alteration in sequence 
compared with the nucleotide sequence shown in Figure 7 or 
corresponding fragment thereof . 

10 62. A method according to claim 61 wherein said alteration is 
selected from those shown in Table 1 . 

63. An assay method for identifying a compound able to bind 
the polypeptide encoded by the polynucleotide of any of claims 

15 1 to 14 or any of claims 15 to 20, the method including: 

(a) bringing into contact said polypeptide or a fragment 
thereof, and a test compound; and 

(b) determining interaction or binding between said 
polypeptide or fragment thereof and the test compound. 

20 

64. An assay method according to claim 63 wherein a compound 
is identified which is able to bind the polypeptide for which 
the amino acid sequence is shown in Figure 2. 

25 65. An assay method for identifying a compound able to 

stimulate a defence response in a plant by interaction with the 
polypeptide encoded by the polynucleotide of any of claims 1 to 
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14 or any of claims 15 to 20, the method including: 

(a) contacting a plant or plant part with a test compound and 
determining stimulation of a defence response; and 

(b) bringing into contact said polypeptide or a fragment 
thereof with a test compound and determining interaction or 
binding between said polypeptide or a fragment thereof and the 
test compound; 

step (b) being performed with a test compound which tests 
positive in step (a) , or step (a) being performed with a test 
compound which tests positive in step (b) , or steps (a) and (b) 
being performed in parallel. 

66. An assay method according to claim 65 wherein stimulation 
of a defence response is determined by monitoring pathogen 
growth and/or viability on the plant or plant part. 

67. An assay method according to claim 65 or claim 66 wherein 
a compound is identified which is able to bind the polypeptide 
for which the amino acid sequence is shown in Figure 2 . 

68. An assay method according to any of claims 65 to 67 
wherein a compound is identified which is able to stimulate 
resistance to powdery mildew in barley. 

69. A method which includes following identification of a 
compound as being able to stimulate a defence response in a 
plant in accordance with any of claims 65 to 68 formulation of 
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the compound, or optionally if the compound is peptidyl nucleic 
acid encoding it, into a composition including at least one 
additional component . 

70 . A method which includes following identification of a 
compound as being able to stimulate a defence response in a 
plant in accordance with any of claims 56 to 58 application of 
the compound, or optionally if the compound is peptidyl nucleic 
acid encoding it, to a plant. 

71. Use of a polypeptide encoded by a polynucleotide according 
to any of claims 1 to 14, in screening for compounds able to 
stimulate a defence response in a plant. 

72 . Use of a polypeptide encoded by a polynucleotide according 
to any of claims 15 to 20, in screening for compounds able to 
stimulate a defence response in a plant. 

73 . A compound able to stimulate a defence response in a plant 
identified by a method according to any of claims 63 to 68. 
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MSDKKGVPARE'LP ETPSWAV 
ATGTCGGACAAAAAAGGGGTGCCGGCGCGGGAGCTGCCGGAGACGCCGTCGTGGGCGGTG 60 

AVVFAAMVLVSVLMEHGLHK 
GCGGTGGTCTTCGCCGCC ATGGTGCTCGTGTCCGTCCTC ATGG AAC ACGGCCTCCACAAG 120 

LGHWFQHRHKKALWEALEKM 
CTCGGCCATTGGTTCC AGC ACCGGCAC AAG AAGGCCCTGTGGG AGGCGCTGG AGAAGATG 180 

KAELMLVGFISLLLIVTQDP 
AAGGCGGAGCTCATGCTGGTGGGCTTCATATCCCTGCTCCTC ATCGTC ACGC AGG ACCCC 240 

IIAKICISEDAADVMWPCKR 
ATCATCGCCAAGATATGCATCTCCGAGGATGCCGCCGACGTCATGTGGCCCTGCAAGCGC 300 

G T E G R K P SKYVD YCPEGKVA 
GGCACCGAGGGCCGCAAGCCCAGCAAGTACGTTGACTACTGCCCGGAGGGCAAGGTGGCG 3 60 

LMSTGSLHQLHVF I F V L A V F 
CTCATGTCCACGGGCAGCTTGCACCAGCTGCACGTCTTCATCTTCGTGCTCGCGGTCTTC 420 

HVTYSVITIALSRLK^MRTWK 
CATGTCACCTACAGCGTCATCACCATAGCTCTAAGCCGTCTCAAAATGAGAACATGGAAG 4 80 

KWETETTSLEYQFANDPARF 
AAATGGGAGACAG AGACCACCTCCTTGG AATACC AGTTCGC AAATGATCCTGC ACGGTTC 5 4 0 

RFTHQTSFVKRH LGLSSTPG 
CGGTTCACGCACCAGACGTCGTTCGTGAAGCGCCACCTGGGCCTCTCCAGCACCCCTGGC 600 

IRWVVAFFRQFFRSVTKVDY 
ATCAGATGGGTGGTGGCCTTCTTCAGGCAGTTCTTCAGGTCAGTCACCAAGGTGGACTAC 660 

LTLRAGFINAHLSQNSKFDF 
CTGACCTTGAGGGCAGGCTTCATCAACGCGCATTTGTCGCAAAACAGCAAGTTCG ACTTC 7 20 

HKYIKRSMEDDFKVVV GISL 
C AC AAGT AC ATC AAG AGGTCG ATGG AGG ACG ACTTC AAGGTCGTCGTCGGCATCAGCCTC 7 80 

P L W G V A I LTLFLD I NGVGTL 
CCGCTGTGGGGTGTGGCGATCCTCACCCTCTTCCTTGACATCAATGGGGTTGGCACGCTC 8 40 

IWISFIPLVILLCVGTKLEM 
ATCTGGATTTCTTTCATCCCTCTCGTGATCCTCTTGTGTGTTGG AACCAAGCTGG AGATG 900 

I IMEMALEIQDRASVI K G A P 
ATCATCATGGAGATGGCCCTGGAGATCCAGGACCGGGCGAGCGTCATCAAGGGGGCCCCC 9 60 

V V E P S N K F F W F H R P D W V L F F 
GTGGTCGAGCCC AGC AAC AAGTTCTTCTGGTTCC ACCGCCCCG ACTGGGTCC7CTTCTTC 10 2 0 

I H L T LFQNAFQH A H F V V/ T V A 
ATAC ACCTGACGTTGTTCC AGAACGCGTTTC AGATGGCGC ATTTTGTGTGG AC AGTGGCC 10 80 

T P G L K K C Y H T Q I G I> S I M K V V 
ACGCCCGGCTTGAAGAAATGCT ACCACACGC AG ATCGGGCTGAGC ATC ATG/UvGGTGGTG 114 0 

V G LA L Q F LCSYHT FP L Y A L V . 
GTGGGGCTAGCTCTCCAGTTCCTCTGCAGCTATATGACCTTCCCCCTCT ACGCGCTCGTC 120 0 

TQMGSNMKRSIFOEQTSKAL 
ACACAG ATGGG ATCAAACATGAAGAGGTCCATCTTCG ACG AGC AG ACGTCC AAGGCGCTC 12 60 

T N W K N T A K E K K K V R 0 T D U L M 
ACC AACTGGCGG AAC ACGGCC AAGGAGAAGAAG AAAGTCCGAG AC ACGGAC ATGCTGATG 1320 

A Q M I G O A T P f> R G S S V M P 3 R G 
GCTCAG ATG ATCGGCGACGC AAC ACCG AGCCGAGGCTCGTCGCCG ATGCCGAGCCCGGGC 1380 
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Figure 2 (Continued) 



SSPVHLLH KGMGRS DDP 'QSA 
TC ATCACCCGTGC ACCTGCTTCAC AAGGGCATGGGGCGGTCGGACGACCCCCAG AGCGCG i 4 4 0 

PTSPRTQQEARDMYPVVVAH 
CCCACCTCGCC AAGG ACCC AGC AGG AGGCT AGGG AC ATGT ACCCGGTTGTGGTGGCGC AC 1500 

PVHRLNPNDRRRSASSSALE 
CCGGTGCAC AGACTAAATCCT AACGACAGG AGGAGGTCCGCCTCGTCGTCGGCCCTCG AA 1560 

ADIPSADFSFSQG* 
GCCGACATCCCCAGTGCAGATTTTTCCTTCAGCCAGGGATGA 1602 
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Figure A 
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FIGURE 5 



292 (ttttAncrrckTttrjttra ^ 341 
II : I I f ! MINI I M I I f I H I : I I I M I M Mil III III 
80 GCANAGCTGATGCTGCTGGGCTTCATNTCCCTGCTTCTCACCGTGGCACA 129 

342 f^Acr^nATCATP^y^AAff ^TATnnATnTCCffAC^ATGCCC^CGACGTCA 391 

II II 1 I I I Mll:n MINI If II II Ml II II 

130 GGCGCC. . .CATCTCCAANATCTGCATCCCCAAGTCGGCTGCCAACATCT 17 6 

392 TGTGGcr^T^AAf^yf^ArraA fyyy;r;ar: . AAftnncAGCAAGTACGT 440 

III Ml llllll Ml : II I I IMI :f Mill 

177 TGTTGCCGTGCAAGGCAGGCCNAGATGCCATCGAAGAANAAGCAGCAAGT 22 6 

441 TnArrTAtrr^nnf^AG GTGAGCAGCAGAGCCCGGACCAGCAGCTTCACGA 490 

I I : I : I II II II I I f I M I I Ml M 
227 GGTCNCCNGTCC . TTGGCCGGCGCCGGCGGCGGGGACTACTGCTCNAAAT 275 

4 91 TGATGAAGAAATCAATACC GAACTTTTTCTTGTTTTCT 528 

I MIMMI IM II : : : 

27 6 TCGATGTGAGAATAACNCCAGCTGCCGGCAAGCACAACCTCGATNCNATN 325 

529 TCTGATTGTCGTCTTGGCTTGGCTTAATTGGTGTGTGTGTGTGTGTTTGC 578 

11:1(1 | M II M I I i I I I I Ml 

326 ACTNATT TAACTATAATTGATTTTTCTTGGGTTTTCTGC 364 

579 A Ga^AA^Tf^^TnAT ^TrrArnffnr AftCTTGCACCAGCTGCACGTC 628 

MM MM Mill III Mill f III IMI III MM Iff I 
365 AGGGCAAGGTGGCGCTGATGTCGGCAAAGAGCATGCACCAGCTGCACATT 414 

629 TTCATCTTCCTCCTCra 678 

M II M I I II M II II I II MIMMI II II II M II II M I I M ^ 
415 TTCATCTTCGTGCTCGCCGTGTTCCATGTTACCTACTGCATCATCACCAT 464 

579 AnCTCTAA^raTnTrAAAGTGAGCCTTTGCTTCT . TCTTCTTCTT 723 

I I II I II II Mill MM II MM 111 II 

465 GGGTTTAGGGCGCCTCAAAGTGAGTTTGTCGTTCTGTCCCTCATGCACAT 514 

724 CTTTTACC . \ GCACGTCTGTCTGTCAGGCGTACCTACCTGTTCA 7 65 

MM I III: M M I If I I I II I 

515 GTTTTCTCTAGTTCTAGCAANATTGTCAGTCCTTCAAATGGATTGTTTCG 564 

766 TCAGGCTTGAGTAAAACTGTTCCATAATCTGC TCCGGCATAA 807 

M M M I M MM MI I MM 

565 ACA AGAAACCCAATTTATTAATTTGCCAGTTAAATATATAATAA 608 

80S TCCTCTCCTCCTG rAftATttAftAArATOGAAGAAATGGGAGACAGAG 853 

I MM I M M I I M 11 M II I II M I I M II 

609 TTGATCTTTCTTGGTTTTAGATGAAGAAATGGAAGAAGTGGGAGTCACAG 658 

854 AHr:A(^TC(^T(^AATAC(^AC?TTC(y!AAATG GTCAGGATCCCCCACTCTG 903 

II II M I II I II II II I M II I I I Ml I II 

659 ACC AACTCATTGG AGT ATCAGTTCGC AATCGGT AG TG . AATTAA 701 

904 CAATCTCCC I . . CTTCTTCGAAACCAAACC TG ATG ATCCATTT AAA 94 6 

I I I I 11 I I M III I MM MUM Mill 
702 GAATCTCCCTAACTATTTCATTTCAGAACCTTTATGATAATGTCTTGAAA 751 

947 GACGCAGGCACGATCAGAGTGAGTGAACTGATGTATGTTCATTTTTTGTG 9 96 

Ml I If I M IM 1 IMI I 
752 GAGGAGGAGCAAATCAG . CTGAAAAATATG ATCGA 785 

997 TCCTTTCAG ATCrrTOHArr^TnCGGTTCACGCACGAGACGTCGTTCGTr; 104 6 

Ml I MIMMI MM III llllll MM II II (Mill Mill 
786 TCCATGCAGATCCTTCACGATTCAGGTTCACGCATCAGACGTCGTTCGTG 835 
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1047 AAGCGCCACCTCZGG rcTCTCC AttAttttTC^A TClGATMnTGR T 1093 

iiiif 1 1 1 1 1 1 1 tin i M 1 1 1 1 r 1 1 1 1 1 1 1 ii 1 1 i it 

836 AAGCGGCATCTGGGATCATTCTCAAGCACCCCTGGGCTCAGATGGATCGT 885 

1094 GAGTTTTTTAGCtTCTTATCTGCCCCTCATCTGTGTGTAATGTT 1137 

INN I INI N I I I INI IN 

886 GAGTTATCAATCTCCGAAT ACATGCTTGTTTTTTATTCTTGCA 928 

1138 . . TGGCGTA I TGGAGTCAGGTGATTT . ACCTT 1165 

UN If I I III IN I I 

929 ACTGGCCTAGCTGTTCCAATTCAATCCATATTTTTTGAAAAAAAAAATAT 978 

1166 GCCTGTGATGTTTGTTGCCTTGTCAG^Tffl^CTTnTTrAf^r.ACTTrTTr i 2 15 
IN I M I I I fill (III II llflNKIIIIKIK 

97 9 TCATGCCGTGTTTG TTGTTAGGTAGCATTCTTCAGGCAGTTCTTT 1023 

1216 AGGTCAGTCACCAAGGT^ArTAnr^aAnCTTOAr ^CA^CTTr.ATnAA 1265 

Nil N I II N I II II I (I II N II N I II 11 N N I I I I N N 
1024 GGGTCCGTCACCAAGGTGGACTACCTGACCATGCGGCAAGGCTTCATCAA 107 3 

1266 £GTACGTGC CTCCCCTTCTAGCTCCGCCATTGCTGCCGCGATGTAG 1311 

M I I f I III II Hill II III || 

1074 TGTATATACTAATC AAACCTGACC AATTCAACATTGATGATGC . AAACAG 1122 

1312 CAGCAAAGCTTCT CAAGTTATCCTTCTGACGCTAAAGTTCCCA 1354 

N I I I I I ! || M I I I I II I I I I I 
1123 AAGACCAGGTTTTTTTTTTCCGAGTTGTGCAT . TGAAGTTAATG 1165 

1355 TGTTTTTTCCTCAAATTATTCTGCGCA GGCG . CATTTarr.Cin A A^ ft fl My; i 40 3 

NIN I II III I I I I I I If N N II N I II II IN 
1166 .GTTTTAGCTTC. . .TTCTCTTTTGCAGGCGCCATTTGTCGCAGAATAGC 1211 

1404 AAGTTCfiACTTCCACAAGTArrATrAArtAGGTnGAT^ AnaAraArTTraft 1453 

M I 11 I I I I ( II I I I II IIMKIMIIKI I I I I I I ! I I I I I 1 I I I 
1212 AAGTTCGACTTCCACAAATACATCAAGAGGTCTTTGGAGGACGACTTCAA. 1261 

K54 GGYCGTCGTCrK^ATQAfiGTACGTTCCATTXX^TCCTCTGCACCAGACCA 1503 

N INN I N II I N II M INI II I 
1262, AGTTGTCGTTGGCATCAGGTCCG TCCTCGCTTT 1294 

1504 CACCCCATGGATAGATTTTAACAATTGCTGTCAGGTTCCACATGATAACA 1553 

I I INN I II I I III INI I 

1295 . ... ATTAATTATAGGA. . . . CTCTTATATTCAACATTTTTTTT 1330 

1554 ATATACTATGA . ACTTGGTCTTTGCTCCTTGTCCTTG . . ! . . CACGATCA 1597 

Nil I I I N INI f INI II fill 
1331 ATAAAGAAACATATTTAGTCT . . . CCAGTTGTGTATGTGT ATGTGG ATCT 1377 

1598 TGACACATTTCGCCTGTTTTCGCAGCCTf^ i €47 

MMIMNIII Ml Ml MINIUM MINI III Ml 
1378 TGACACATTTGG . CTGGTTTTGCAGCCTCCCTCTGTGGTTCGTCGGAATC 1426 

1648 CTC ACCCTHTTTy^T T(Z A f! A Tf! A A TfSfyr A TCZG xmrTwnn TCTCCGGTT T 1697 

JLL N N N II II III I I I 1 1 Mil (Ml Ml 
1427 CTTGTACTCTTCCTCGATATCCACGGTA . . ATCCTTGTCCT ATTT 1469 

1698 CTCTATTGCTTTGCAGCTAAATAAAACACTTGCAATTCGTCTCGTGATCA 1747 

III' IN N N I I Nil MM I INI 

1470 CATTCTTTTTTTTACTCTCAAAACCTTGTTCTGAATTGGtCTTATAATCA 1519 

1748 CCGCTCATTTTTCAACCATTTCTTTTTCTACTC^TAG GGGTTaanArfy-T 1797 

1C->A 1 ' I I I I N I I N I II I I III I II I N II I I 

1520 CCATCG ATTTTTTTTC AACTT . TTTCCCCGCGTGTAGGTCTTGGCACACT 1 S 6 8 

17 9* CATCTfifiATTTCTTTCATCCCTCTCGTGGTAAGTGC . AGATTTCTCC . AT 1845 

N Mill I M M I I II M I I Mill M I INN II I 
1569 TATTTGGATCTCTTTTGTTCCTCTCATCGTAAGAGCGAAATTTCCCCTGT *1618 
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1846 CGAAAGCAACAGCAAACCCAATT „ . TGATCGCAAT 1878 

( If II (Nil NIMH Iff Ml 

1619 CCAAAGAAACAGTTAACATAATTAATTATGCTTTAATTTATCATGAAAAT 1*68 

1879 GG AAACCCACACCTAATATTAACTCAAAATGTCAATTGTCGGTGCGTCTT 1928 

f I If ((I MM l(( Ml Ml ( 

1669 TAATATGATCATATAACT AATG AACAAACATTCA , . TGTGAATGCCACCG 1716 

1929 CCTCAAt^G ArcrTCTTG TnTftT^ 1978 

(((in mum ((((( iiiiiifi (((in i ((((( 

1717 TTGTCTCAGATCGTCTTGTTAGTTGGGACCAAGCTAGAGATGGTGATCAT 1766 
1979 GGAGATGCraW^AGATHPAraArC^ 2028 

llim KIM (MM ((MM MM I M Ml MM M f 
17 67 GGAGATGGCCCAAGAGATACAGGACAGGGCCACTGTGATCCAGGGAGCAC 1816 

2029 CCGTf^Tnn AGCCnAGnAA^AAGTTCTTCTGGTTCCACCGCCCCGACTGG 2078 

1 MM M M IMIIMIM 1MMMIM MIIMI MUM 
1817 CTATGGTTGAACCAAGCAACAAGTACTTCTGGTTCAACCGCCCTGACTGG 1866 

2079 GTCCTCTTCTTHATACArcTGAHGTTGTT 2107 

M I I M M M M t M M M 11 I M 
1867 GTCTTGTTCTTC AT AC ACCTG ACACTCTTCCC ATGTAC ATGTTTAAAACC 1916 



2108 f^AGAAfra .GTTTCAGATGGCGCATTTTG 2136 

MMIMM I I I ( I M I It I I II f I I 1 
2017 GACGGACGGATCGATCATCACCAGAACGCATTTTCAGATGGCGCATTTCG 2066 

2137 TGTGGArAGTrc GTACGCCAC ! . . . CGATGAACTTGTCAGTT 2173 

I (Mil M I It M I I 111 M I II M I 

2067 TATGGACTATGGTGTGTATGCTACTTGCTTAGTTGTTGCCATTATCAGTT 2116 

2174 AACATGGGTGTCA. . . AGGCACCG AGTGCCGCTG ATGA ! 2208 

M I Mill 1 MM II M MM 

2X17 CTTAAGCAAATTAAGTGTGATGCATGCACTGA CTAATGAGACAA 2160 

2209 .ACTGCTCTGACGGAGATTTACTTGTGTTGT* ! AGGCC 2243 

II I M I lilt I M I I I M ( M I 

2161 AAA&TGACACAGCTTGTTCATCGATCTGGTTGTTTTGTGTGTGACAGGCA 2210 

2244 Arayrara CTTGAAGAAAT ^ 2293 

(I M II MIIMIMIM III III 1 1 1 1 1 I I I I 1 I I 

2211 ACACCTGGTCTG AAG AAATGCTTCCATG AAAATATTTGGCTG AGCATCGT 2260 

2294 GAAcrcrnCTorrrc^^ 2343 

I I II I M I I M I I I II I II I M I M II I II I I I I M 
2261 GGAAGTCATTGTGGGGATCTCTCTTCAGGTGCTATGCAGCTACATCACCT 2310 

2344 Trcrarrrr ^Ar^GCT 2389 

MM M I M II II M M M I I M M II I M M I M M 
2311 TCCCGCTCTACGCGCTCGTCACACAGGTGAACAAGCCATTCACAAA 2356 
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FIGURE 6 



295 GAGCTCATGCTf^TnGGrTTCATATCnrTfirrTrrTr ATCGTCArnrAanA 344 
M I I I I : I I 1 ! i I I I I I I [ I I I I I I [ I I I I I I I I I I M I I u I I I f I I I I 
1 GAGCTCNTGCTGGTGGGCTTCATATCCCTGCTCCTCATCGTCACGCAGGA 50 

345 CCCCAT CflTCGCCAAGATATGCATCTCCGA^ATCCCfircaArr:TrATr:T 39 4 
I I I N 111 III I I I I I j I I I I I I I I I I I IN: : I I I 
51 TCC . . . CGTCTCC AGG ATCTGC ATCTCCAAGGAGGCCGGCG AN AANATGC 97 

395 GGCCCTfiCft AG. . . . . CGCGGCACCGAC^ncr^AA^crr.A . 430 
I I I I I I I I : ( M | | || || Mil || 

98 TCCCGTGCAAGCCTTACKACGGCGCCGGCGGTGGCAAAGGCAATGACAAT 147 

431 G C kAGTACGT TGAcrAcrnr.rr.ank 455 

Ml: M : I II III 1 
148 C ACCGGAGGCTTCTCTGGCTCC AAGGCG AN AGCG AN ACCC ACCGCCGGTT 197 

4 56 GGTGAGCAGCAG AGCCCGGACCAG " 47 9 

MUM 111111:1 
198 CCTG . GCTGCCCCGGCCGG ANTGG ACGTCTGCGCC AAAC AGGTG AGC ACC 2 4 6 

480 CAGCTTCACGATGATGAAGAAA . TCAATACCGAACTTTTTCTTGTTTTCT 52 8 

1-111:11 f I I i I I : I I Mil ill f I 
2 47 TANCGTCNCCACAAACCACAAACTANCTAATGAGCATGGACCTGAATTTC 29 6 

52 9 TCTGATTGTCGTCTTGGCTTGGCTTAATTGGTGTGTGTGTGTGTGTTTGC 57 8 

I I I I I M M I II I I M II I II I I fill 
297 TTCTCTTCTTGGCTTGGCTTGACTAAATTGGT TGTGC 333 

579 AGGGCAA GGTGGCGCTCATGTCCACGr^CAGCTTGCArrAarTnrAr^r 628 

f M M II I I II II M I I I (I :: I If I Ml I M I I M I I II II I 
334 ACGGCAAGGTGGCGCTGATGTCNNCGGGAANCATGCACCAACTGCACATA 383 

629 TTCATCTTCGTGCTCGCGGTCTTCCATGTCArinTAr AGCGTnATnArrAT 678 

M M M I II M I M I I I I I I M I I I Ml M II II M I II II II I 
38 4 TTCATCTTCGTGCTCGCCGTCTTCCACGTCTTGTACAGCGTCGTCACCAT 433 

€7 9 AGCTCTAAGCCGTrTHAAAnTr:A^rrTTTr:rTTrTTnT^T^r'T TCTTT 7 12 Q 

I M I II II II I M I M II I II I I II 
434 GACCCTAAGCCGTCTCAAAGTGAGCATCATACTC 4 67 

729 ACCGCACGTCTGTCTGTCAGGCGTACCTACCTGTTCATCAGGCTTGAGTA 77 8 

M I I I I I M III Ml II I 

468 GAGCTGTTTGTCAAT AATCCTT . . . GGTTTCCAATCCAATTCCA 508 

779 AAACTGTTCC AT AATCTGCTCCGGC AT AATCCTCTCCTCCTGC AGAIGAG 828 

11 '11 1 1 MM Mill I I MM II Mill 

509 AAGCTGGCACTGATCCTGCTCCGG CTTCCTGCAGATGAA 547 

829 AACATGGAA GAAATGGGA G A C AGAG ACCACCTrrTTGGAATArrA^TT^ r: 878 

M II I M II I II II I I II I II I I I I I MM M M M I M 
54 8 GCAATGGAAGAAGTGGGAGTCGGAGACCGCCTCGCTGGAGTATCAGTTCG 5 97 

87 9 CA&AIGGTC AGG ATCCCCC ACTCTGC AATCTCCCCTTCTTCG AAACC AAA 928 

I MM! MM If I l Ml [ Ml 

5 98 CGAATGGTCAG CTTCAACTTTTCTTACTGAAA 62 9 

92 9 CCTGATGATCCATTT . . . AAAGACGCAGGCACGATCA . . . . GAGTGAGT 97 0 

M M M (MM I I M I I I I M M M M I I || 

630 CCGGATG . . . CATTTACAACAAACGCACGCACGATCAATCATCACAGTGT 67 6 

971 GAACTGAT . GT ATGTTC ATTTTTTGTGTCCT . TTCAG ATCC. _ TOP Amp 1016 

M I Ml M i I I MM I I I! M I I Ml 

67 7 GAGCCGATACGTTGAACCCGATTGAAATCCTCCGCAGATCCCATCGCCGG 72 6 
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1017 TTCC!C^TTCftr^ ArCAfiA rnTrnTT - CGTGAAGCGCr.ACCTGGGCCTCT 1065 

1 | ( | | | | M I II I I II I \ i I I I I I Mil I I I M I I I I I I I I I I I 

727 TGCCGGTTCACGCACCAGACGACGTTGGGTGAGGCGGCACCTGGGCCTCT 77 6 

1066 CCA^Arrnc rT^rATrAnATnGGTCiGTGAGTTTTTTAGCTTCTTATCTG 1115 

I I 1 { I I I 1 M III I I I I I M I I I 

777 CCAGCACCCCCGGCGTCAGATGGGT 801 



1166 r^rTr:Tr:&Tr:TTTr:TTr:rrTTr;TrAf; f;T(y;r:CTTCTTCAGGCAGTTCTTC 1215 

I I II I I I I I I 11 I I M I I I I II M I 
802 GGTGGCCTTCTTCAGGCAGTTCTTC 826 

1216 AGGTCACTC ArPAA^TGaArTArCTGACCTTGAGGGCAGGCTTCATCAA 1265 

I III II II I I I I I I I I I II I I I I I I I I I I I M MINIUM! 
827 ACGTCGGTGACCAAGGTGGACTACCTGACCTTGCGGCAGGGCTTCATCAA 87 6 

12 66 CGT ACGTGCCTCCCCTTCTAGCTCCGCCATTGCTGCCGCG ATGT AGCAGC 1315 
I 

877 C 877 



1366 rA&ATTATTrT^rr^Af: nr:(;rATTTnTC:GCAAAACAGCMGTTCGACTTC 1415 

I I II II I IMM II 1 I M II M II I I 
878 GCGC ATCTCTCGC AGGGC AAGAGGTTCG ACTTC 910 

1416 CACAAGTArATCAAGAf^THnATG^Ar^ACGACTTCAAGGTCGTCGTCGG 1465* 
I I II II I N M M II M I II I 1 II I II I I M M I I I I I II M M I I I 
911 CACAAGT AC ATCAAGAGGTCGTTGGAGGACG ACTTC AAAGTCGTCGTCCG 960 

1466 CA1CAGGTACGTTCCATTCCTTCCTCTGCAC CACACCACAC 1506 

MMIMi III I 11 I M 11 t MINIM II I Ml 

961 C ATC AGGT ACGCGCC ATTCCTTTCTCTGC AC AAATT AAT AC ATCC ACCAC 1010 

1507 CCCATGGATAGATTTTAACAATTGCTGTCAGGTTCCACATGATAACAATA 1556 

I I II : I M M II N : 1:111 
1011 C ACAT ANGT AG AT AG AT AGA . TCG AT ANAT AN ATT A 1045 

1557 TACTATGAACTTGGTCTTTGCTCCTTGTCCTTGCACGATCATGACACATT 1606 

Mill I I I 1 I 1 I M I 1 MM I Mlllll 
104 6 TAC . AAGTGCCGGT ACGTACGT ACGTCTC AT . . . ATG ATCTTG AC AC ATC 1091 

1607 TO^CTf;TTTTraCA( y.CTCCCGCT r;TaGGGTGTGGCGATCCTCACCCTC 1656 

MINN Mil Ml I M I I I I M M I I I I I I I MM 
1092 TGTCCTCTTGCCGCAATCTCAAGCTCTGGTTCGTGGCGGTCCTCATCCTC 1141 

1657 TTCCTTftACATCAATG GTATGGACCTTCTCC . TCTCCGGTTTCTCTATTG 17 05 

II It I II I II I INI II M I II I Mil I I 

1142 TTCCTTGATTTCGACGGTAGCCGCCTTGTCCATGCCCTGCTCGCCCTCTC 1191 

1706 CTTTGCAGCTAAATAAAACACTTGCAATTCGTCTCGTGATCACCGCTCAT 17 55 

II I 1 II IN 1 I I I I I N I I I M 
1192 CTCCGCTTCTCTCCATAATTTGTG . AACTTGTCCCGT AT 1229 

17 S 6 TTTTr a anr ATTTrTTTTTrTArTrATAfyy;n TTGGCACGCTCATCTGG 1804 

I 1 I I II III III Mill I Mill II Mill 

1230 ATAACCACACCACCGTCGTCTTCTCGCAGGGGATCGGCACTCTTCTCTGG 127 9 

1805 ATTTCTTTrATnrrrTCTCGTC^G TAAGTGCAGATTTCTCCATCGAAAGCAA 18 54 

UN 1 I I II I I Mil I I I II I t I IN (I I 

12 80 ATGTCCGTGGTTCCTCTCGTGGTAAGTCCA C AATTTG AAT AG A 1322 

1855 CAGCAAACCCAATTTGATCGCAATGGAAACCCACACCTAATATTAACTCA 1904 

III NUN lilt III II IN II 

132 3 CAACCTGTCCAATTGTGATGTACAGTACCTCCAAACTTAA TTA 1365 
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1905 AAATGTCAATTGTCGGTGCGTCTTCC TCAACAG ATCCTCTTGTGT 194 9 

I I I II I f Ml II Mill I M M I I 1 I I M I I I I 

1366 AC ATGTCATTTGCTGAT . . GTCTTGCGTGTAACATTAGATCCTCTTGTGG 1413 

1950 GTTGGAACCAA^TraAGATfiATCAT^ ^ 1999 

I II I I I I M I I M I II I I f I I I II I I I I I I II I I II I Ml:lllll 
1414 GTTGGGACC AAGCTGGAGATGGTG ATCATGGAG ATGGCCC AGGAN ATCC A 14 63 

2000 GGACfy^HGAf^GTrATCA^ 2049 

II I I I I I 11111(1 II I I II I I I INI I I f I I II I I I I I I I I I 
1464 TG ACCGGGAG AGCGTCGTC AAGGGTGCTCCCGCCGTCG AGCCCAGC AAC A 1513 

205C ACTTrTTrTGnTTrrAcr:c^rccnAnTacy;Trr:TrTTCTTCATArAcr:T^ 2099 
III II l I l l I I I l I l I I N I I I I I I I I I I I M l l U I 1 l t 1 1 I 

1514 AGT ACTTCTGGTTCAACCGGCCTG ACTGGGTCCTCTTCCTC ATGC ACCTC 1563 

2100 ACGTTGTTCCAaAArnrCTTTnAnATGnrr^ATTTT nTnTnGArAaT^T 2149 

II I I I I II M I I I II I I I I I II I I I I Mill I I I I I I I I I I I M I 
1564 ACACTCTTCCAGAACGCGTTTCAGATGGCTCATTTCGTGTGGACAGTGGT 1613 

2150 ACGCCACCGATGAACTTGTCAGTTAACATGGG 2181 

I I : I I I I I I I I II I I I I I : I 
1614 A . . . CNTACAAGTACTTGTCACTTC ACTTANGCTAACTCC AACAAACGAA 1660 



2182 TGTCAAGGCACCGAGTGCCGCTGATGAACTGCTCTGACGGAG 2223 

I I I I I I I I I I 1 I III 111 

1711 GACACAAAACTCAATCCAACGCGCGGTAGCAAACGAACGTTTTTCCGTAC 1760 

2224 ATTTAC ] TTG 2232 

111 I I I I 

17 61 GTTTTCGTCCGCTTTCGCCCCATCCCAGCCCAAATTCGTTGACGTTGTTG 1810 

2233 TGTTGTA GGCCAnC^CCG^TTGAAGAAATGCTACCACACGCAGATCGGC: 2282 

I I I I I I I I I M I I I I I I I I II I I I I II II I II I I I I II I 
1811 CATCGCAGGCCACGCCCGGCTTGAAGAAATGCTACCACGAGAAAATGGCA 1860 

2283 rTGAC^AT CATGAAGGTGGTGGTGGGGCT^ 2332 

I I I I I I I I I I I I I III Mill II II . II I 1 II I Mill 
1861 ATGAGCATCGCCAAGGTCGTGCTGGGGGTAGCCGCCCAGATCTTGTGCAG 1910 

2333 CTATATGACCTTCCCrC TCT^ 2 382 

: M II I M I M 1 I M : II I M II I I M It I 
1911 NT AC ATC ACCTTCCCGCTNTACGCGCTCGTC AC 1943 



24 33 AATCATCTGTGTGTGCTGGCTTTGTATGCAG ATnGGATrAAArATGAAGA 2 4 82 

I II I I I I M III I I II [ I I I I 

1944 GCAGATGGGCTCACACATGAAGA 1966 

2483 GGTCCATCTTCGACGAGCAGACGTCCAAGGC . GCTCACCAACTGGCGGAA 2531 

I II : M I I I II II II It II I I I I I M I Ml M I II I It M I It 

1967 GAAGCANCTTCGACGAGCAGACGGCCAAGGCGGCTGACCAACTGGCGAAA 2016 

2532 rACGGCCAAGGAGAAGA AGAAAGTCCGAGACArGGACATGCTGATGGrTr 2581 

1 I M I I I I I M II M M II I I M II I I III I M II I I II I I I I 

2017 GATGGCCAAGGAGAAGAAGAAGGCCCGAGACGCGGCCATGCTGATGGCGC 2066 

2582 AGATGATCGGCGArGrAACACCGAGCCGAGGCTCGTCGCCGATGCrGAGr 2631 

Mill Mill Ml M It I II I I M I : M II I M 

2067 AG ATGGGCGGCGGCGCG ACGCCGAGCGTCGGCTNGTCGCCG 2107 
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2632 CGGGGCTC A *tr Anr.CGTGC. ACCTGCTTC AC AAGGGCATGGGGCGGTCGG A 2SSI 

II M I I I 1 M 1 I I I I 1 I I I II MINI f I 
2108 GTGCACCTGCTCCACAAGGCCGGGGCGCGGTCCGA 2142 

2682 CGACrrr.nAGAttGCGCC r.kcw^ 2731 

1 I I I I M I I I i I I I I ! I 1 f I I I I I I I 1 ! I I 1 I I I II 
2143 CGACCCCCAGAGCGTGCCGGCGTCCCCGAGGGCCGAGAAGGAAGGCGGCG 2192 

2732 ACATGTArCCf^TTGTC^T^GnGCACCC aOTnrACAGArTAAATnCTAAr 2781 

1 Ml III (Iff f I ( ( ft I I I I 

2193 GC GTGC AGC ATCCGGCGCGC AAGGT ACCTCCT TGT 2227 

2782 GACA^AGGAGGTCCGCCTrnTCGTCfinCCCTroAAGCCGftCATCCCCftrr 2831 

111 II I I I I I I 1 I I I I I 1 I I I I I I I I I I I I M MUMM I 
2228 GACGGGTGGAGGTCGGCCTCGTCGCCGGCGCTCGACGCTChCATCCCCGG 2211 

2832 TGCAGATTTTTCCTTCAGC I CAGGGATGAGACAAGTTTCTG 2871 

( I I I I i 1 I I I lit II If I lit II Mil Ml 11 

2278 TGCAGATTTTGGCTTCAGCACGCAACGTTGACCGATCAGACAAGTTCCTT 2327 

2872 TATT 2875 
f I f 

2328 TTTT 2331 
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helix II 



helix 111 



helix IV 
helix V 



GG CTGCTCCCC CAG C A AACC AC AC ACACAGCAGC GT ACCTGCCT 

ACGTAGCGTGCGCTTTCTTTTTTTTCCTTTCGCCTCTCTTGCTTGCTCCGGCCGGCCACG 

TCG ATAG CCCG CC ACG G C C AGG C ACCTCC C GG TTCCG TCGCCTGCATCTGCG TGTGCGTA 

CCTGCTAGAGCCCCCCGTCTCCTTGCTCCCGGCAAGGAAGCAGCTTGCCGCGGrCGACCG 

MS DXXGVPARELPETPS jW^&XMM 20 
ATGTCGG AC AAAA AAG GGGTGCCGGCGCGG G AGC TGCCG G AGACGC CC TCGTG GGCGGT G 60 



|A : >V-- V . r.A ■ A MVLVSVLMl E H G L H K 40 

GCGCTGGTCTTCGCCGCCATGC TGC TCGTG TCCGTCCTCATGGAACACGGCCTCCACAAG 120 

LGHWrQHRHKKALWEALZKK 60 

CTCCCCCATTGGTTCCAGCACCGGCACAAG AAGGCCCTGTGGGAGGCCCTGGAGAAGATG 1 80 

A 

K A E 1L-VH-- L • V - G F- I S . L I, . L I *V'&T?ZQWO)iS&[ 60 

AAGGCGG AGCTCATG CTCCTGGCCTTCATATCCCTGCTCCTCATCGTCACGCAGGACCCC Z 4Q 



1 1 '^IfVA:. KKj ' C - I- S | EDAADVMW PCKR 100 

ATCATCGCCAAGATATGCATCTCCGAGGATGCCGCCGACGTCATGTGGCCCTGCAAGCGC 300 

GT E G R X P S KYVDYCPEG X V A 120 

GGCACCGAGGGCCGCAAGCCCACCAAGTACGTTGACTACTG CCCGGAGGGCAAGGTGGCG 360 



LMSTGSLHQtn | V • F<- I' f F - V -rj .f .- A *?>VAHT ] 140 
CTCA TGTCCACGGGCAGCTTGCACC AG CTGC^CGTCTTCATCT TCGTG CTCGCGGTCTTC 4 20 



1 H V T T S V I T I A L l SRLKMRTWK 160 

CATGTCACCTACAGCGTCATCACCATAGCTCTAAGCCGTCTCAAAATGAGAACATGGAAG 4 90 

K V ETETTS U E T QFANDPARF 190 

AAATGGGAGACAGAGACCACCTCCTTGGAATACCAGTTCGCAAATGATCCTGCACGGTTC 540 

RTTHQTS FVXRHLG LSSTPG 200 

CGGTTCACGCACCAGACGTCCTTCGTGAAGCGCCACCTGGGCCTCTCCAGCACCCCTGGC 600 

I RWVVAF FRQFFRS V T X V D T 220 

ATCAGATGGGTGGTGGCCTTCTTCAGGCAGTTCTTCAGGTCAGTCACCAAGGTCG ACTAC 660 

LTLRAGFlNAHLSONSXFDr 2 40 

CTGACCTTG AGGGCAGGCTTCATCAACGCGCATTTGTCGCAAAACAGCAAGTTCGACTTC 7 20 



HXYlXRSMEDDFX f V V V G ■ I .:'S"W-L| 260 
CACAAGTACATCAAGAGGTCGATGGAGGACGACTTCAAGGTCGTCGTCGGCATCAGCCTC 7 60 



IP L W G V A . T L T L F L I D I N G V G |T-UL| 280 
CCGCTGTGGGGTGTGGCGATCCTCACCCTCTTCCTTGACATCAATGGGGTTCGCACGCTC 6 40 



jGj T X L E K 300 



ATCTGG ATTTCTTTCATCCCTCTCCTG ATCCTCTTGTGTGTTGGAACCAAGCTGG AG ATG 9 00 

I IMEHALEIQDRASVIXGAP 320 

ATCATCATGGAGATGGCCCTGGAGATCCAGGACCGGGCGAGCGTCATCAACGGGGCCCCC 9 60 

V V E p S ' " N KFFWFHRPOWVLFF 340 

GTGGTCGAGCCCAGCAACAAGTTCTTCTGGTTCCACCGCCCCGACTGGGTCCTCTTCTTC 10 20 

I H LTLFQNAFQMAKFVVTVA 360 

ATAC ACCTG ACGTTGTTCCAGAACGCGTTTCAGATGGCGCATTTTGTGTGGACAGTGGCC 1 0 80 



IPG LKXCYHTQIC LSIMK I V->*V| 360 

ACGCCCGGCTTGAAGAAATGCTACCACACGCAGATCGGGCTCAGCATCATGAAGGTGGTG 1140 

heliX VI | V - G • -L * ■ A ft Q F I C S Y M T F P L T -AV-. LStfy] 400 

CTGGGGCTAGCTCTCCAGTTCCTCTGCAGCTATATGACCTTCCCCCTCTACGCGCTCGTC 1200 

[tJQMGSNMXRSIP'DEQTSKAL 420 

ACACAG ATGGCATCAAACATGAAGAGGTCCATCTTCCACGAGCAG ACGTCCA AGGCGCTC 1 2 60 



TNWRNTAKB )X X X V R] D T 0 H L M 440 

ACCAACTGGCG G A AC A CGG CCA AGG AG AAG AAGAAAG TC CG AG ACACG GACATCCTGATG 1320 

A Q M 1 GOAT PS RGSS PHPS RG 460 

GCTCAGATGATCCGCG ACGCAACACCGAGCCGAGGCTCGTCGCCGATGCCGAGCCGGGGC 1360 

SSPVULLKXGMGRSDOPOSA 460 

TCATCACCCCTCCACCTGCTTCACAAGGGCATGGGGCCGTCGGACGACCCCCAGAGCGCG 14 40 

PTS PRTQQEARDMYPVVVAH 500 

CCCACCTCGCCAAGGACCCAGCAGGAGCCTAGGGACATGTACCCCCTTCTGGTGGCGCAC 1 500 

PVHRLNPNORRRSASSSALE 520 

CCGGTGCACAG ACTA A ATCCTAACG ACAGGACCAGGTCCCCCTCGTCG TCGGCCCTCGAA 1560 

A O I PSADr5FC9C* 
GCCG AC ATCCCCACTCCACATTTTTCCTTCAGCCAGCGATC AC ACAAGTTTCTGTATTCA 

TGTTACTCCCfcATCTATAGCCAACATAGGkTr.TCATGATTCGTACAATJSsACAAATACAAT 
I I 

TTTTTACTC ACTC 
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Figure 8 

1 GAATTCAATT AAGGACAACA ACGGATGATA GGCTTAAGCT AGAGAGGATT 
51 CATATCGATT AATTAACTGT ACTTAAGTTG AGGTAAAACT CTATCGATTG 
101 CTTTGGACAC CGGCTCTCCC ATGATCTGCC AAGTTGAGCC GGCCTACCTA 
1S1 ATTTTCTTCG AAAGCACACA ACAAACGAAG GTAACCACTA ATCTAGACAC 
201 CACGCCTAAG TT ATCAATT A CTACTCTAGT CTCGCGTAGA AACTTCATTC 
251 TTTATGGAGA GTGCTAGTAC T AG AG TACTT AATA.TAATAG TAAGCGACAA 
3 01 ACCCACGACG ATG AG AATGT ACCTCACTTA CGTAGTCAAT TAAGTCGAAA 
351 AGGAAATCTT GAACACTTAC TTTATTAAAG AAGTATTCCC CGAGGTACAG 
401 GAGAGGAGAG CACGCCAATA ACTCCAGCAC TCCTCCGAAA CCTTTCTCAC 
451 TCTCTACCGT TTTTCTCCAC ACAACTAAAA TGATGTCTAA TGTATGAAAG 
501 TGAGTTGTAC TCTATTTTGT TGTGTGTTTG GAAGTGAAAT TAGCTCATCC 
551 TTTTATAGCA ACTTAATGCT CGGTTGTAGG TTGG T AA T T A AGTCGGTAAA 
6 01 CACTCACAAC CACCATCGTC AACCAATAGG AG ATCGC C AC ATG ATCG AAA. 

6 51 GCTGACAGTT AGGGGTGCCA AC CCTGTTTT GTCCGAACC A AGCAAACAAC 

7 01 CTCTATCTAG GACCTCTCTT CTATGTCTGA CAAGTCGGCC CATATGGCGG 
7 51 TGCACTATGG ATTAAGTCAA TTTCAGTCG? TTTGGACTGT CATGTGGGCC 
801 CTTCCAATCC TTGTGCTCCC ATA'TGATTGG TCGAAAGTAC ATTTAATTCC 
851 TGGGTGAGTG CTAGAACTAA TATGATAGAT GTGCTCCGGC TCCTGGGAAA 
901 GAGGCCACTT GACATACTTG GGGTACTGCC CCAAGGGTAT TCCCT ATCGC 
951 TTTTTCATAA TTTTCTCTCT CCAAAATCGG ACGGAAACAA TAAAAAAGAG 

1001 AGGCGATGTT CATCGGCAAA TATCTATTTT TTTGATA.GTG TCTTCCCTTA 
1051 AAACTTGATT TTTGCGAAGA CTTCCGGCTA AAACCATGAA ATCAGAGTTC 
1101 CTTGTAACAA ATTTAATTTG CC7AAATACA AAAAAGATCG AATGGAGATA 
1151 GCATTAAACT TGCTCCATAC GAATCATATT AGTTGGACCG 7AACTCATAG 
12 01 AAAAAGTTGC AAGTTGGTTG ACCTATCAAC CCTCTTATGT TGACCGTAAA 

12 51 CCTGTTATGC ATTAAGGATT AAGTACCGGC AGATCGTCAC TACTCACGAA 
1301 TGCACAAATT TCCGGTAACG TAGGATGGGA TGAGTTGGTC ACAAACGGGT 

13 51 CACCACGTCG CCCAACCTGC CGCGATCGAG CCATTGGCCC GCGATGCACG 
1401 CGCTTTGACA CAGCCGCCCG CCGCCCCCCG GCCCGCCCCC G TTTTT AAT A 
1451 AAAAC CGGC C GCCCCCTGTC AAAGGTCTCA AAGTGTCAAG TGCATCAGAG 
1501 CTAAGCTAGC GGTCACCCAG TCAGCTCACC CCGAGACGCA CCAGGGGATC 
1551 TATCGGATCA TGGCAGGTGG GAGATCGGGA TCGCGGGAG'f TGCCGGAGAC 
1601 GCCGACGTGG GCGGTGGCCG TCGTCTGCGC CGTCCTCGTG CTCCTCTCCG 
1651 CCGCCATGGA GCACCGCCTC CACAACCTCA GCCATGTACG CGCGCGCGCA 
17 01 CGCGGTGTGC TCATCTCTCG AGTTAATTTG GTTGTTGTTG TTGTTGTGTT 
17 51 CTTGTGACAT CTCAATTAAC ATCCGATCGT GGTCGATCGA TCGCCCTGTG 
1601 GTGGC G AT AC TCCTTGCATT GCAGTGCTTC CGTAGGCGGC AG AAG AAGGC 
165 1 CATCXSGCGAC GCCCTCC-ACA AGATCAAAGC AGGTCACCCT CAGCCTCAGC 



WO 98/04586 PCT/GB97/02046 

15/28 

FIGURE 8 cont ' d 

1901 TCACCCTCAG CCTCCATCTC TAAATATTTG ACGCCGTTGA CTTTTTTAAA 
1951 TATGTTTGAC CATTCGTCTT ATTTAAAAAA TTTAAGTAAT TATTAATTCT 
2001 TTTTCTACCA TTTGATTCAT TGCTAAATAT ACTATTATGT ATACATATAG 
2051 TTTTACATAT TTCACTAAAG TTTTTAAATA AGACGAATGG TCAAACATGT 
2101 TTAAAAAAGT CAACGGCGTC AAACATTTAG GAAGAAGAGA ATATTATATT 
2151 GCTGCTCCCC TCTAGCCACT TTGCTGCCTC CCTCGTCATT TTTTCAAGTA 
2201 TTTTACGCAA GACTGGTCCT CCAAATCAAA CGTCACAAAT AAGCCATTTA 
2251 TAGTTTCCTT TCGCTTTTTA AGGGGGACTA CTTGTATTTA ATCATGGAGG 
2 3 01 AAACTACCAG TCGGATGTCC GATTACTTAA AAAAAAATTC GGGGG AC TAA 
23 51 TTTTTTTGGC TGATCATCGG TGAAATATTA GGTTATATAT GTTGAAAAAA 
2401 AATCAGCCAC AAACAATGAA ATATTTTGTG AAACACATAT TAGACACGTT 
2451 GAAACGTATC ATTGTTACGT ATAAAACATC GAATGTTAAC AGATTAAAAC 
2501 ATATGTTTTT TTTTAATCAG AATATAATCA TGCGATATAT TATTGTAAAG 
2551 ATATAATTAC AACGAATACA ACAGTGCGAT CGGATTATAT ATATATTAGT 
2601 AGTTTAAGAG AAAAATCATT TTGAAGATTA CTAGATACAT ACACGTATAG 
2 651 ATGGATGAAG TGGAGAGAGA TTAGAGATAA GTAGTTATAT GAATTTTGTG 
2701 AAACACACTT AAGACATATG TTCAAACATA CTGCTATTAT GTATGAAATA 
2751 TTGAGTTTTA ACGGTTTAAA ACACATATTC TTTTAATTAG AATGTAATAA 
2 801 TGTGATATCT TGTTGTAAAA TTTAATTACA TCTAATATAA CGGTGTGATT 
2 851 AGATTGTATG TTGGATAACA TGCCCATCGG TTGGCTTATT TAGGGAATAA 
2901 GCCAAATGGT ATATTTGCAA ACGAAAAATA ATTTGTAAAT AAAACTTTTA 
2951 TGTATGTATT CTTAACGATC TAGCAGCAAA GGCTGAAAAA TAAACTTCGA 
3001 TGAAAAATCT CAAAATCAAC TCTTAAAATT TAAATTTTGG CTTATAAGTA 
3051 TAGTTCCTAA CTAGTTTAGA AGAAAAAATA TTTAAAGCGG GGAAGAGGAA 
3101 AAGGAATAAA CTAATAGCTA AATTATTGCA TGCATGTAGC GATTTGAGGA 
3151 CGACCGAGTT GTTTTGTCTG GATCAGCCGA CCGAGACAGA GCAATCTTCT 
3201 TTAATCATAA ATAACCAGAA AAACCATACC AGTTCATCAC AATGGACCGA 
3251 GTCAGAGTCA TTACATATTT TTCATTGTTG CGCACAGGAT TCACCATGTT 
3301 CTTATGGGAA ATATTTTTAA CTCTCAAATG GTTATGATTT TGAACTCTCA 
3351 TTTTTGAGAG AGAATTAACA AGCGAGCGAG CAATCAGGCC AAAAAGGGAG 
3401 AAAGAAAATT ATTTTTGTTA ATTTTTTTTT AAGGTAGGGT GGAGGAGTCA 
3451 TTACATGATT TTTTTTTATA TTCCCTCGTT GATTATATGC TGTTCAAATG 
3 501 GTTATGATTT TTTTAAAAGA TAACAACAAT ACAAATTAGT ATGTGATAGA 
3 551 TCATTTCACG AGCATATAGG ATTAAATTTA ACTTCTGTAA ATTACAAAAC 
3601 AAACAAGTTT AACTGTTAAT ATACATTAAA TTTGTTTTTT TCAACTTAGG 
3 651 AATTGAATTT TATGTATATA TTTGTAAAAT GATATATTAA TTTATTTTTT 
3701 TAAAAAAATA ATTATTTAGA TAACACGCAA ACTAGAAAAC CACCGCAGAA 
3751 GTTCTCATAT TTCTTGTCCT ATCTGCACTT GCAGAGCTGA TGCTGCTGGG 
3801 CTTCATATCC CTGCTTCTCA CCGTGGCACA GGCGCCC ATC TCCAAGATCT 
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3851 GCATCCCCAA GTCGGCTGCC AACATCTTGT TGCCGTGCAA GGCAGGCCAA 
3901 GATGCCATCG AAGAAAGAAG CAGCAAGTGG TCGCCGGTCC TTGGCCGGCG 
3951 CCGGCGGCGG GGACTACTGC TCGAAATTCG ATGTGAGAAT AACACCAGCT 
4001 GCCGGCAAGC ACAACCTCGA TGCAATAACT AATTTAACTA TAATTGATTT 
4051 TTCTTGGGTT TTCTGCAGGG CAAGGTGGCG CTGATGTCGG CAAAGAGCAT 
4101 GCACCAGCTG CACATTTTCA TCTTCGTGCT CGCCGTGTTC CATGTTACCT 
4151 ACTGCATCAT CACCATGGGT TTAGGGCGCC TCAAAGTGAG TTTGTCGTTC 
42 01 TGTCCCTCAT GCACATGTTT TCTCTAGTTC TAGCAAGATT GTCAGTCCTT 
4251 CAAATGGATT GTTTCGACAA GAAACCCAAT TTATTAATTT GCCAGTAAAT 
4301 ATATAATAAT TGATCTTTCT TGGTTTTAGA TGAAGAAATG GAAGAAGTGG 
4351 GAGTCACAGA CCAACTCATT GGAGTATCAG TTCGCAATCG GTAGTGAATT 
4401 AAGAATCTCC CTAACTATTT CATTTCAGAA CCTTTATGAT AATGTCTTGA 
4451 AAGAGGAGGA GCAAATCAGC TGAAAAATAT GATCGATCCA TGCAGATCCT 
4501 TCACGATTCA GGTTCACGCA TCAGACGTCG TTCGTGAAGC GGCATCTGGG 
4 551 ATCATTCTCA AGCACCCCTG GGCTCAGATG GATCGTGAGT TATCAATCTC 
4601 CGAATACATG CTTGTTTTTT ATTCTTGCAA CTGGCCTAGC TGTTCCAATT 
4651 CAATCCATAT TTTTTGAAAA AAAAAATATT CATGCCGTGT TTGTTGTTAG 
4 701 GTAGCATTCT TCAGGCAGTT CTTTGGGTCC GTCACCAAGG TGGACTACCT 
4751 GACCATGCGG CAAGGCTTCA TCAATGTATA TACTAATCAA ACCTGACCAA 
4 801 TTCAACATTG ATGATGCAAA CAGAGACCAG GTTTTTTTTT TCGAGTGTGC 
4851 ATTGAGTAAT GGTTTTAGCT TCTTCTCTTT TGCAGGCGCA TTTGTCGCAG 
4901 AATAGCAAGT TCGACTTCCA CAAATACATC AAGAGGTCTT TGGAGGACGA 
4951 CTTCAAAGTT GTCGTTGGCA TCAGGTCCGT CCTCGCTTTA TTAATTATAG 
5001 GACTCTTATA TTCAACATTT TTTTTATAAA GAAACATATT TAGTCTCCAG 
5051 TTGTGTATGT GTATCTGGAT CTTGACACAT TTGGCTGGTT TTGCAGCCTC 
5101 CCTCTCTGGT TCGTCGG AAT CCTTCTACTC TTCCTCGATA TCCACGGTAA 
5151 TCCTTGTCCT ATTTCATTCT TTTTTTTACT CTCAAAACCT TGTTCTGAAT 
5201 TGGTCTTATA ATCACCATCG ATTTTTTTTC AACTTTTTCC CCGCGTGTAG 
5251 GTCTTGGCAC ACTTATTTGG ATCTCTTTTG TTCCTCTCAT CGTAAGAGCG 
5301 AAATTTCCCT GTCCAAAGAA ACAGTTAACA TAATTAATTA TGCTTTAATT 
5351 TATCATGAAA ATTAATATGA TCATATAACT AATGAACAAA CATTCATGTG 
5401 AATGCCACCG TTGTCTCAGA TCGTCTTGTT AGTTGGGACC AAGCTAGAGA 
5451 TGGTGATCAT GGAGATGGCC CAAGAGATAC AGGACAGGGC CACTGTGATC 
5501 CAGGGAGCAC CTATGGTTGA ACCAAGCAAC AAGTACTTCT GGTTCAACCG 
5551 CCCTGACTGG GTCTTGTTCT TCATACACCT GACACTCTTC CATGTACATG 
5601 TTTAAAACCT AAACCTTGCT GCTCAACTAC AAATAGTACT TTATCTTTCA 
5651 CAATTAACAC CTAATTAACT AACATAGCAT CCATCCATTT GTGGCTACTG 
5701 ATCGATGGGA CGACGGATCG ATCATCACCA GAACGCATTT CAGATGGCGC 
5751 ATTTCGTATG GACTATGGTG TGTATGCTAC TTGCTTAGTT GTTGCCATTA 
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CTCCTCCCTC 


TTCCTACCAA 


ACACAGTCTC 


1201 


ATCCAAACAT 


GTAACAACAC 


ATGCATGACC 


ACCAAACAAC 


TGAAGATGAA ; 


1251 


TGTATTCATC 


ATGTCTATAC 


TTACCATGCA 


TCAACAGGGA 


ACAACTATGC 


1301 


TAGGGTGAGA 


ACAGCTGCCA 


AACACACCCG 


TGCACCTACT 


CATGCTGTGC 


1351 


CGGCGCTGGC 


GTACGTGTGC 


AGTGGTTCCA 


CAAGTGGCGC 


AAGAAGGCCC 


1401 


TGGGGGAGGC 


GCTGGAGAAG 


ATGAAGGCGG 


AGCTCATGCT 


GGTGGGCTTC 


1451 


ATATCCCTGC 


TCCTCATCGT 


CACGCAGGAT 


CCCGTCTCCA 


GGATCTGCAT 


1501 


CTCCAAGGAG 


GCCGGCGAGA 


AGATGCTCCC 


GTGCAAGCCT 


TACGACGGCG 


1551 


CCGGCGGTGG 


CAAAGGCAAG 


GACAATCACC 


GGAGGCTTCT 


CTGGCTCCAA 


1601 


GGCGAGAGCG 


AGACCCACCG 


CCGGTTCCTG 


GCTGCCCCGG 


CCGGAGTGGA 


1651 


CGTCTGCGCC 


AAACAGGTGA 


GCACCTAGCG 


TCGCCACAAA 


CCACAAACTA 


1701 


GCTAATGAGC 


ATGGACCTGA 


ATTTCTTCTC 


TTCTTGGCTT 


GGCTTGACTA 


1751 


AATTGGTTGT 


GCAGGGCAAG 


GTGGCGCTGA 


TGTCAGCGGG 


AAGCATGCAC 


1801 


CAACTGCACA 


TATTCATCTT 


CGTGCTCGCC 


GTCTTCCACG 


TCTTGTACAG 


1851 


CGTCGTCACC 


ATGACCCTAA 


GCCGTCTCAA 


AGTGAGCATC 


ATACTCGAGC 
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1901 


TGTTTGTCAA 


TAATCCTTGG 


1951 


TCCTGCTCCG 


GCTTCCTGCA 


2001 


GACCGCCTCG 


CTGGAGTATC 


2051 


TACTGAAACC 


GGATGCATTT 


2101 


AGTGTGAGCC 


GATACGTTGA 


2151 


TGCCGGTTCA 


CGCACCAGAC 


2201 


CAGCACCCCC 


GGCGTCAGAT 


2251 


CGTCGGTGAC 


CAAGGTGGAC 


2301 


GCGCATCTCT 


CGCAGGGCAA 


2351 


GTCGTTGGAG 


GACGACTTCA 


2401 


TCCTTTCTCT 


GCACAAATTA 


2451 


AGATCGATAG 


ATAGATTATA 


2501 


GATCTTGACA 


CATCTGTCCT 


2551 


CGGTCCTCAT 


CCTCTTCCTT 


2601 


CTGCTCGCCC 


TCTCCTCCGC 


2651 


ATATAACCAC 


ACCACCGTCG 


2701 


GATGTCCGTG 


GTTCCTCTCG 


2751 


GTCCAATTGT 


GATGTACAGT 


2801 


TGATGTCTTG 


CGTGTAACAT 


2851 


GAGATGGTGA 


TCATGGAGAT 


2901 


CGTCAAGGGT 


GCTCCCGCCG 


2951 


APPGGPCTGA 


PTGGGTCCTC 


3001 

J v U 1 




x vtvj v-. x xxx 


3051 

•J \J -J X 


TPAPTTTAPT 


TARfiPTAArT 


3101 

.J X w X 


V^uL.VJ X O X VJ X X 


TfiTinriTATGT 

XVJ\j\3\JXAiVJ X 


3151 




f2PAA APflAAP 


3?m 

J ^ V X 






3251 


PPTTOAAf^lAA 


ATf^PTAPPAP 


3301 




TAfSPPfSPPPA 


3351 




GTPAPGCAGA 


3401 


APHAGPAC5AP 


GGPPAAGGPG 


3451 


AAGAAGAAGG 




3501 


CGCGACGCCG 


AGCGTCGGCT 


3551 


GGGCGCGGTC 


CGACGACCCC 


3601 


AAGGAAGGCG 


GCGGCGTGCA 


3651 


CGGGTGGAGG 


TCGGCCTCGT 


3701 


CAGATTTTGG 


CTTCAGCACG 


3751 


TTTTTCGGTG 


AATAGAAGCG 


3801 


CAGGAATGGC 


TGTCCTACTA 



TTTCCAATCC AATTCCAAAG CTGGCACTGA 
GATGAAGCAA TGGAAGAAGT GGGAGTCGGA 
AGTTCGCGAA TGGTCAGCTT CAACTTTTCT 
ACAACAAACG CACGCACGAT CAATCATCAC 
ACCGATTGAA TCCTCGCAGA TCCATCGCGG 
GACGTTGGTG AGGCGGCACC TGGGCCTCTC 
GGGTGGTGGC CTTCTTCAGG GAGTTCTTCA 
TACCTGACCT TGCGGCAGGG CTTCATCAAC 
CAGGTTCGAC TTCCACAAGT ACATCAAGAG 
AAGTCGTCGT CCGCATCAGG TACGCGCCAT 
ATACATCCAC . CACCACATAG GTAGATAGAT 
CAAGTGCCGG TACGTACGTA CGTCTCATAT 
CTTGCCGCAG TCTCAAGCTC TGGTTCGTGG 
GATTTCGACG GTAGCCGCCT TGTCCATGCC 
TTCTCTCCAT AATTTGTGAA CTTGTCCCGT 
TCTTCTCGCA GGGATCGGCA CTCTTCTCTG 
TGGTAAGTCC ACAATTTGAA TAGACAACCT 
ACCTCCAAAC TTAATTAACA TGTCATTTGC 
TAGATCCTCT TGTGGGTTGG GACCAAGCTG 
GGCCCAGGAG ATCCATGACC GGGAGAGCGT 
TCGAGCCCAG CAACAAGTAC TTCTGGTTCA 
TTCCTCATGC ACCTCACACT CTTCCAGAAC 
CGTGTGGACA GTGGTACGTA CAAGTACTTG 
CCAACAAACG ACCCCAAATT AATGGTCCGT 
TTGGGGTAAA CGGACACAAA ACTCAATCCA 
GTTTTTCCGT ACGTTTTCGT CCGCTTTCGC 
TTGACGTTGT TGCATCGCAG GCCACGCCCG 
GAGAAAATGG CAATGAGCAT CGCCAAGGTC 
GATCTTGTGC AGCTACATCA CCTTCCCGCT 
TGGGCTCACA CATGAAGAGA AGCATCTTCG 
CTGACCAACT GGCGAAAGAT GGCCAAGGAG 
GGCCATGCTG ATGGCGCAGA TGGGCGGCGG 
CGTCGCCGGT GCACCTGCTC CACAAGGCCG 
CAGAGCGTGC CGGCGTCCCC GAGGGCCGAG 
GCATCCGGCG CGCAAGGTAC CTCCTTGTGA 
CGCCGGCGCT CGACGCTCAC ATCCCCGGTG 
CAACGTTGAC CGATCAGACA AGTTCCTTTT 
TATCATTTCA TTGATAGACA GTAGAAATTA 
CTATGTACAC AAGGGCACAG CAAAGGATCA 
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Figure 9 cont'd 



3 851 TTGATCTTGT TACAAGAGCA GTAGAAAGGG ATTGCTCTCC ATTGATCTTG 

3901 TTAAGTTGTA TGTCACAAAT TGTTGCAGAA AAAAGTGTAT GTCATCCCAA 

3 951 CCAAGAGCTG AGTTTGTGAT GATTCGTGCA ATAAGAATTG CAAGTTTCAC 
4001 CGAGTCAAAA ATGAAGCTTC T AAGT ACG C A CCAACCAACG GACTCTTTCA 

4 051 TCTCAACAAA AGAACTGTAA ATGGCAATAA TTCTGATAAC ATCGGAAGGG 
4101 AGCTC 
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Figure 1 0 



1 


ATCGCAGGTG 


GGAGATCGGC 


51 


GGCGCTIX5GCC 


GTCGTCTGCG 


101 


AGCACGGCC? 


CCACAACCTC 


1S1 


TTTCTTCTCC 


TATCTCCACT 


201 


CCTGCTTCTC 


ACCGTGGCAC 


251 


AGTCGGCTGC 


CAACATCTTG 


301 


GAAGAAGAAG 


CAGCAAGTGG 


351 


GGACTACTGC 


TCGAAATTCG 


401 


GCATGCACCA 


GCTGCACATT 


451 


ACCTACTGCA 


TCATCACCAT 


501 


GAAGAAGTGG 


GAGTC ACAGA 


SSI 


ATCCTTCACG 


ATTCA.GGTTC 


601 


CTGGGATCAT 


TCTCAAGCAC 


651 


CAGGCAGTTC 


TTTGGGTCCG 


701 


AAGGCTTCAT 


CAATGCGCAT 


751 


AAATA C ATCA 


AGAGGTCTTT 


801 


CAGCCTCCCT 


CTGTGGTTCG 


SSI 


ACGGTCTTGG 


CAC AC TT ATT 


901 


TTGTTAGTTG 


GGACCAAGCT 


951 


G AT AC AGG AC 


AG G<^C CAC T G 


1001 


GCAACAAGTA 


CTTCTGGTTC 


10 51 


CACCTGACAC 


T C T TC CAT AA 


1101 


TATGGCAACA 


C C TGG T CTGA 


1151 


GCATCGTGGA 


AGTCATTGTG 


1201 


ATCACCTTCC 


CGCTCTACGC 


1251 


GAAGACAATT 


TTCGAGGACC 


1303 


AGAAGCCGAT 


GGAGAAGAAG 


1351 


CAGATGAGCG 


TCGACTTCGC 


1401 


GGTGCACCTG 


CTGCAGGTCA 


1 0S1 


TCACGGTGGC 


CTCACCACCG 


1501 


GCGGCGGCTG 


CGTCTCGCCA 


1SS1 


GATGGCATCC 


TCGTCGGCCG 


1601 


CACAACGGTG 


A 
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ATCGCGGGAG 


TTGCCGGAGA 


CGCCGACGTG 


CCGTCCTCCT 


GCTCGTCTCC 


GCCGCCATGG 


AGCCATAAAA 


CCACCGC*G* 


i> G' 1 ~ T *C' r CA T A 


TGCAGAGCTG 


A.TGCTGCTGG 


GCTTCAVATC 


AGGCGCCCAT 


CTCCAAGATC 


TGCATCCCCA 


TTGCCGTGCA 


AGGCAGGCCA 


AGATGCCATC 


TCGCCGGTCC 


TTGGCCGGCG 


CCGGCGGCGG 


ATGGCAAGGT 


GGCGCTGATG 


TCGGCAAAGA 


TTCATCTTCG 


TGCTCGCCGT 


GTTCCATGTT 


GGGTTTAGGC- 


CGCCTCAAAA 


TG AAG AAATG 


CCAACTCATT 


GGAGTATCAG 


TTCGCAATCG 


ACGCATCAGA 


CGTCGTTCGT 


GAAGCGGCAT 


CCCTGGGCTC 


AGATGGATCG 


TAGCATTCTT 


TCACCAAGGT 


GGACTACCTG 


ACCATGCGGC 


TTGTCGCAGA 


ATAGCAAGTT 


CGACTTCCAC 


GGA.GGACGAC 


TTCAAAGTTG 


TCGTTGGCAT 


TCGGAATCCT 


TGTA.CTCTTC 


CTCGATATCC 


TGGATCTCTT 


TTGTTCCTCT 


C A.TC ATCGTC 


ACAGATGGTG 


ATCATGGAGA 


TGG C CCAAGA 


TGATCCAGGG 


A.GCACCTATG 


GTTGAACCAA 


AACCGCCCTG 


ACTGGGTCTT 


G TTTTTCAT A 


CGCATTTCAG * 


ATGGCGCATT 


TCGTATGGAC 


AC-AAATGCTT 


CCATGAAAAT 


ATTTGGCTGA 


GGGATCTCTC 


TTCAGGTGCT 


ATGCAGCTAC 


GCTCCTCACA 


CAGATGGCAT 


CGAACATGAA 


AAACGATGAA 


GGCGCTGATG 


AACTGGAGGA 


AAGGTCCGGG 


ACGCCGACGC 


GTTCCTGGCG 


GACGCCGGCG 


TCGAGCCGGT 


CCGCC-TCGCC 


CAGGGCGGGT 


CGGACGCCCG 


CCGAGCCCAA 


GCACCGGAGC- 


AGGACATGTA 


CCCGGTGCCC 


GC TG C TAG AC 


GACCCGCCGG 


ACAGGAGGTG 


ACATCGCCGA 


TTCTGATTTT 


TCCTTC AGCG 
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Figure f 1 



1 


ATGGCTGGGC 


51 


GG1 XJGCGGTA 


101 


ACGCGCTCCA 


151 


CTGGGGGAGG 


201 


CATATCCCTG 


251 


TCTCCAAGGA 


301 


GCCGGCGGTG 


351 


ACGCGAGAGC 


401 


ACGTCTGCGC 


451 


CACCAACTGC 


501 


CAGCGTCGTC 


551 


AGTGGGJ>_GTC 


601 


TCGCGGTGCC 


651 


CCTCTCCAGC 


701 


TCTTCACGTC 


751 


ATC AACGCGC 


801 


CAAGAGGTCG 


851 


AGCTCTGGTT 


901 


GGC AC TCTTC 


951 


TGGGACCAAG 


1001 


AC C GG GAG AG 


1051 


TACTTCTGGT 


1101 


ACTCTTCCAG 


1151 


CCCCCGGCTT 


1201 


AAGGTCGTGC 


1251 


CCCGCTCTAC 


1301 


TCTTCGACGA 


1351 


AAGGAGAAGA 


1401 


CGGCCGCGCG 


1451 


AGGC:CGGGGC 


1501 


GCCGAGAACG 


1S51 


TTGTGACGCG 


1601 


CCGC:TGCAGA 



CGGCGQGP.GG TCGGGAGCTG 
GTC TGCGCCG TCATGATACT 
CAAGCTCGGC CACTGGTTCC 
CGCTGGAGAA GATGAAGGCG 
CTCCTCATCG TCACCCAGGA 
GGCCGGCGAG AAGATGCTCC 
GCAAAGGCAA GGACAATCAC 
GAGACCCACC GCCCGTTCCT 
CAAACAGGGC AAGGTGGCGC 
AC AT ATTC AT CTTCGTGCTC 
ACCATGACCC TAAGCCGTCT 
GGAGACCGCC TCCCTGGAGT 
GGTTCACGCA CCAGACGACG 
ACCCCCGGCG TCAGATGGGT 
GGTGACCAAG GTCGACTACC 
ATCTCTCGCA CGGCAACAGG 
TTGGAGGACG ACTTCAAAGT 
CGTGGCGGTC CTCATCCTCT 
TCTGGATGTC CGTGGTTCCT 
CTGGAGATCG TGATCATGGA 
CGTCGTCAAG GGTGCTCCCG 
TCAACCGGCC TGACTGGGTC 
AAC G C GTTTC AGATGGCTCA 
G AA G AAATGC T ACC ACG AG A 
TGGGGGTAGC CGCCCAGATC 
GCGCTCGTCA CGCAGATGGG 
GCAGACGGCC AAGGCGCTGA 
AGAAGGCCCG AGACGCGGCC 
ACGCCGAGCG TCGCCTCGTC 
GCGCTCCGAC GACCCCCAGA 
AAGGCGGCGG CGTGCAGCAT 
TGGAGGTCGG CCTCGTCGCC 
TTTTCGCTTC AGCACGCAAC 



TCGGACACGC CGACGTGGGC 

CGTCTCCGTC GCCATGGACC 

ACAACTGGCG CAAGAAGGCC 

GAGCTCATGC TGCTGGGCTT 

TCCCGTCTCC AGGATCTGCA 

CGTGCAAGCC TTACGACGGC 

CGG AGGCTTC TCTGGCTCCA 

GGCTGCCCCC GCCGGAGTGG 

TGATGTCAGC GGCAAGCATG 

o - 

GCCGTCTTCC ACGTCTTGTA 

CAiAAATGAAG CAATGGAAGA 

ATCAGTTCGC GAATGATCCA 

TTGGTGAGGC GGCACCTGGG 

GGTGCCCTTC TTCAGGCACT 

TGACCTTGCG GCAGGGCTTC 

TTCGACTTCC ACAAGTACAT 

CGTCGTCCGC ATCAGTCTCA 

TCCTTGATTT CGACGGGATC 

CTCGTGATCC TCTTGTGGGT 

GATGGCCCAG GAGATCCATG 

CCGTCGAGCC CAGCAACAAG 

CTCTTCCTCA TGCACCTCAC 

TTTCGTGTGG ACAGTGGCCA 

AAATGGCAAT GAGCATCGCC 

TTGTGCAGCT ACATCACCTT 

CTCACACATC AAGAGAAGCA 

CCAACTGGCG AAAGATGGCC 

ATGCTGATGG CGCAGATGGG 

GCCGGTGCAC CTGCTCCACA 

GCGTGCCGGC GTCCCCGAGG 

CCGGCGCGCA ACGTACCTCC 

GGCGCTCGAC GCTCACATCC 
GTTCA 
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1 GTTGGTACAT AAAAGACTCT TCCTTTGTCT GTTTTTTGTT CCCAGATTCA 

51 TCTTTACTTA TTGACTAAAT TCTCTCTGGT GTGAGAAGTA AAATGGGTCA 

101 CGGAGGAGAA GGGATGTCGC TTGAATTCAC TCCGACGTGG GTCGTCGCCG 

151 GAGTTTGTAC GGTCATCGTC GCGATTTCAC TGGCGGTGGA GCGTTTGCTT 

201 CACTATTTCG GTACTGTTCT TAAGAAGAAG AAGCAAAAAC CCCTTTACGA 

251 AGCCCTTCAA AAGGTTAAAG AAGAGCTGAT GTTGTTAGGG TTTATATCGC 

301 TGTTACTGAC GGTATTCCAA GGGCTCATTT CCAAATTCTG TGTGAAAGAA 

351 AATGTGCTTA TGCATATGCT TCCATGTTCT CTCGATTCAA GACGAGAAGC 

401 TGGGGCAAGT GAACATAAAA ACGTTACAGC AAAAGAACAT TTTCAGACTT 

451 TTTTACCTAT TGTTGGAACC ACTAGGCGTC TACTTGCTGA ACATGCTGCT 

501 GTGCAAGTTG GTTACTGTAG CGAAAAGGGT AAAGTACCAT TGCTTTCGCT 

551 TGAGGCATTG CACCATCTAC ATATTTTCAT CTTCGTCCTC GCCATATCCC 

601 ATGTGACATT CTGTGTCCTT ACCGTGATTT TTGGAAGCAC AAGGATTCAC 

651 CAATGGAAGA AATGGGAGGA TTCGATCGCA GATGAGAAGT TTGACCCCGA 

701 AACAGCTCTC AGGAAAAGAA GGGTCACTCA TGTACACAAC CATGCTTTTA 

751 TTAAAGAGCA TTTTCTTGGT ATTGGCAAAG ATTCAGTCAT CCTCGGATGG 

801 ACGCAATCCT TTCTCAAGCA ATTCTATGAT TCTGTGACGA AATCAGATTA 

851 CGTGACTTTA CGTCTTGGTT TCATTATGAC ACATTGTAAG GGAAACCCCA 

901 AGCTTAATTT CCACAAGTAT ATGATGCGCG CTCTAGAGGA TGATTTCAAA 

951 CAAGTTGTTG GTATTAGTTG GTATCTTTGG ATCTTTGTCG TCATCTTTTT 

1001 GCTGCTAAAT GTTAACGGAT GGCACACATA TTTCTGGATA GCATTTATTC 

1051 CCTTTGCTTT GCTTCTTGCT GTGGGAACAA AGTTGGAGCA TGTGATTGCA 

1101 CAGTTAGCTC ATGAAGTTGC AGAGAAACAT GTAGCCATTG AAGGAGACTT 

1151 AGTGGTGAAA CCCTCAGATG AGCATTTCTG GTTCAGCAAA CCTCAAATTG 

1201 TTCTCTACTT GATCCATTTT ATCCTCTTCC AGAATGCTTT TGAGATTGCG 

1251 TTTTTCTTTT GGATTTGGGT TACATACGGC TTCGACTCGT GCATTATGGG 

1301 ACAGGTGAGA TACATTGTTC CAAGATTGGT TATCGGGGTC TTCATTCAAG 

1351 TGCTTTGCAG TTACAGTACA CTGCCTCTTT ACGCCATCGT CTCACAGATG 

1401 GGAAGTAGCT TCAAGAAAGC TATATTCGAG GAGAATGTGC AGGTTGGTCT 

1451 TGTTNGTTGG GCACAGAAAG TGAAACAAAA GAGAGACCTA AAAGCTGCAG 

1501 CTAGTAATGG AGACGAAGGA AGCTCTCAGG CTGGTCCTGG TCCTGATTCT 

1551 GGTTCTGGTT CTGCTCCTGC TGCTGGTCCT GGTGCAGGTT TTGCAGGAAT 

1601 TCAGCTCAGC AGAGTAACAA GAAACAACGC AGGGGACACA AACAATGAGA 

1651 TTACACCTGA TCATAACAAC TGAGCAGAGA TATTATCTTT TCCATTTAGA 

1701 GGATCATCAT CAGATTTTAG CTTCAAGGTC CGGTTTTGTG GTTTATACAT 

1751 AAGTTATAGT GACTTGATTT TTTTGTTTTG TTACAAAGTT ACCATCTTTG 

1801 GATTAGAATT GGGAAATTGA ATCTGTTTGT ATATTGTATT ATTTGGAACA 

1851 TTGTGGATGC CCATGGATAT GTTTCTGTTC 
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1 MAGGRSGSRE LPETPTWAVA WCAVLVLVS AAMEHGLHNL SHKTTAEVLI 

51 FLVLSALAEL MLU3FISLLL TVAQAPISKI CIPKSAANIL LPCKAGQDAI 

101 EEEAASGRRS IiAGAGGGDYC . SKFDGKVALM SAKSMHQLHI FIFVLAVFHV 

151 TYCIITMGLG RLKMKKWKKW ESQTNSLEYQ FAIDPSRFRF THQTSFVKRH 

2 01 U3SFSSTPGL RWIVAFFRQF FGSVTKVDYL TMRQGFINAH LSQNSKFDFH 

251 KYIKRSLEDD FKVWGISLP LWFVGILVLF LDIHGLGTLI WISFVPLIIV 

301 LLVGTKLEMV IMEMAQEIQD RATVIQGAPM VEPSNKYFWF NRPDWVLFFI 

351 HLTLFHNAFQ MAHFVWTMAT PGLKKCFHEN IWLSIVEVIV GISLQVLCSY 

4 01 ITFPIiYALVT QMGSNMKKTI FEEQTMKALM NWRKKAMEKK KVRDADAFLA 

451 QMSVDFATPA SSRSASPVHL LQVTGRVGRP PSPITVASPP APEEDMYPVP 

501 AAAASRQLLD DPPDRRWMAS SSADIADSDF SFSAQR* 
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1 MAGPAGGREL SDTPTWAVAV VCAVMILVSV AMEHALHKLG HWFHKWRKKA 

51 LGEALEKMKA ELMLVGFISL LLIVTQDPVS RICISKEAGE KMLPCKPYDG 

101 AGGGKGKDNH RRLLWLQGES ETHRRFLAAP AGVDVCAKQG KVALMSAGSM 

151 HQLHIFIFVL AVFHVLYSW TMTLSRLKMK QWKKWESETA SLEYQFANDP 

201 SRCRFTHQTT LVRRHLGLSS TPGVRWWAF FRQFFTSVTK VDYLTLRQGF 

251 INAHLSQGNR FDFHKYIKRS LEDDFKWVR ISLKLWFVAV LILFLDFDGI 

3 01 GTLLWMSWP LVILLWVGTK LEMVIMEMAQ EIHDRESWK GAPAVEPSNK 

3 51 YFWFNRPDWV LFLMHLTLFQ NAFQMAHFVW TVATPGLKKC YHEKMAMSIA 

4 01 KWLGVAAQI LCSYITFPLY ALVTQMGSHM KRSIFDEQTA KALTNWRKMA 
451 KEKKKARDAA MLMAQMGGGA TPSVGSSPVH LLHKAGARSD DPQSVPASPR 
501 AEKEGGGVQH PARKVPPCDG WRSASSPALD AHIPGADFGF STQR* 
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Figure 1 5 



1 


MGHGGSGHSL 


E FT PTWWM 


VCTVIVAXSL 


AVERX.LHYFG 


TVLKKKKOICP 


51 


LYEMiQKVKE 


ELAfLLGFXSLi 


LLTVFQGLIS 


KFCV/KENVXiM 


HMLPCSLDSR 


101 


REAGASEHKN 


VTAKEHFQT? 


LPIVCTTRRI* 


LAEEIAAVQVG 


YCSEKGKVPL 


151 


LSLEALHHLH 


iFirvuo:sfi 


VTFCVXrTVIF 


GSTRIHQWKX 


WHDSIADEKF 


201 


DPETAL*RKRR 


VTIIVHNHAFX 


KEHFLGIGKD 


SVTLGVJTQSF 


LKQFYDSVTK 


251 


SDYVTLRLGF 


XMTHCKGNPK 


LNFHKYMMRA 


LEDDFKQWG 


ISWYDWIFW 


301 


IFLLiLNVNGW 


HTYFWXAFIP 


FALIiLAVGTK 


LEHVTAQIiAH 


EVAEKHUAXF.._ 


351 


GDLWKPSDE 


HFWFSKPQ1V 


LYLIHFILFQ 


NAFEXAFFFW 


IWVTYGFDSC 


401 


XMGOVRYXVP 


RLVXCVFIQV LCSYSTLPLY AXVSQMGSSF 


KKAILEENVQ 


451 


VGLVGWAQKV 


KQKRDL.KAAA 


SNGDEGSSQA 


GPGPDSGSCS 


APAAGPGAGr 


501 


AGIQLSRVTR 


NNAGDTNNEI 


TPDHNN* 
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FIGURE 16 (CONT/D) 
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